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EXPRESSION OF PACE IN HOST CELLS AND 
METHODS OF USE THEREOF 

Background of the Invention 

This invention relates generally to the 
production of proteins in recombinant host cells. More 
particularly, it relates to materials and methods for the 
production of mature forms of proteins from heterologous 
precursor polypeptides using a paired basic amino acid 
converting enzyme (PACE) , which is expressed in selected 
host cells. 

Many eukaryotic proteins are naturally 
synthesized as larger precursor polypeptides, which 
require further specific proteolytic processing for full 
maturation prior to secretion. However, many of these 
eukaryotic proteins or precursors when synthesized in 
bacteria fold incorrectly or inefficiently and, 
consequently, exhibit low specific activities. 
Posttranslational proteolysis is frequently required for 
the synthesis of fully biologically active, mature 
proteins and peptides in all eukaryotes examined, 
including yeast [R.S. Fuller et al., Ann. Rev. Phvsiol. , 
50:345 (1988)], invertebrates [R. H. Scheller et al. , 
Cell , 32 ; 7 (1983)], and mammalian cells [J. Douglass et 
al., Ann. Rev. Biochem. . 53.:665 (1984); and W.S. Sossin 
et al., Neuron . 2, 1407 (1989)]. 

One of the early events in precursor protein 
maturation is endoproteolytic cleavage at the carboxyl 
side of paired basic amino acid sequences (e.g., -Lys- 
Arg- and -Arg-Arg-) . This kind of endoproteolytic 
cleavage was initially inferred from the sequences of 
several endocrine and neuroendocrine precursor proteins 
and was first proposed from studies of proinsulin [D.F. 
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Steiner et al., Science . 157:697 (1968); R-E. Chance et 
alw Science . 161:165 (1968)] and the ACTH/6-endorphin 
precursor I proopiomelanocortin (POMC) [M. Chretien and 
C.H. Li, can. J, Biochem. , 45:1163 (1967)]. Subsequent 
studies have revealed a broad spectrum of precursor 
proteins that require endoproteolysis at pairs of basic 
amino acids to yield mature peptides including serum 
factors [A.K. Bentley et al, Cell , 45:343 (1986)], viral 
proteins [CM. Rice et al,. Virology. 151:1 (1986); CM. 
Rice et al., Science , 229:726 (1985); J.M. McCune et al.. 
Cell . 53:55 (1988)], growth factors [L.E. Gentry et al., 
Mol, Cell Biol. . 8:4162 (1988); K. Sharpies et al., DNA, 
6:239 (1987); M. Yanagisawa et al., Nature. 332:411 
(1988); and Gray et al., Nature . 303:722 (1983)] and 
receptors [Y. Yosimasa, Science . 240 :784 (1988)]. See, 
also, Dickerson et al, J. Biol, Chem. . 265:2462 (1990); 
Achsletter et al, EMBO J. . 4:173 (1985); and Mizuno et 
al, Biochem. Biophvs- Res. Commun. . 144 :807 (1987). 

Cleavage at the site of a paired basic amino 
acid sequence removes many propeptides which function in 
a variety of roles in the processing of the mature 
protein. In certain cases the propeptide can mediate 
correct folding and disulfide bond formation within the 
protein sequence. In other cases the presence of the 
propeptide appears to be involved in 7-carboxylation of 
glutamic acid residues in vitamin K-dependent coagulation 
factors. 7-carboxylated proteins include Factor IX and 
Protein C, and certain bone-specific proteins, such as 
bone Gla protein/osteocalcin. The propeptide can also 
direct intracellular targeting and regulate the 
coordinate synthesis of multiple mature peptides from a 
single precursor polypeptide. 

The sequences of the propeptide domains of 
certain vitamin K-dependent blood coagulation proteins 
have been published [See, Furie et al, Cell . 53:505 
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(1988)] and the size of the propeptide has been 
established for both Factor IX and Protein C. Factor IX 
is a zymogen of a serine protease that is an important 
component of the intrinsic pathway of the blood 
coagulation cascade. The protein is synthesized in the 
liver and undergoes extensive co- and post-translational 
modification prior to secretion. These modifications 
involve endoproteolytic processing to remove the pre- and 
pro-peptides ^ glycosylation, vitamin K-dependent 7- 
carboxylation of 12 amino-terminal glutamic acid residues 
and )S-hydroxylation of a single aspartic acid residue. 

The 7-carboxyglutamic acid residues confer 
metal binding properties on the mature Factor IX protein 
and may function similarly in the processing of the other 
vitamin K-dependent blood clotting proteins. These 7- 
carboxyglutamic acid residues are essential for coagulant 
activity. The gamma-carboxyglutamate (GLA) domain of 
Factor IX has also been identified as a major requirement 
for cell binding [Derian et al, J. Biol. Chem, , 
264(12) :6615-6618 (1989)]. 

With the advance of genetic engineering, 
many eiikaryotic proteins are being produced recombinantly 
in selected cell lines. For example, Chinese Hamster 
Ovary (CHO) DUKX cell lines producing recombinant Factor 
IX at high antigen levels (20 Mg/nil/day) have been 
isolated. However, only 1-2% of that recombinant protein 
is 7-carboxylated, and therefore biologically active, in 
the presence of vitamin K3 [Kaufman et al, J. Biol. 
Chem. . 261(21) :9622-28 (1986)]. Additionally, amino- 
teirminal sequencing of the recombinant protein has found 
that 50% of the recombinant Factor IX produced by the CHO 
cells retain the propeptide [Derian et al, J. Biol. 
Chem . . 264(12): 6615-18 (1989) ]• Presumably, the 
endoproteolytic processing enzyme of the CHO cells 
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directing this cleavage was either satiirated or simply 
inefficient in its function. 

Several activities capable of cleaving at 
single or paired basic residues ia vitro have been 
proposed as candidates for authentic mammalian precursor 
endopr oteases . See, for example, Y.P. Loh and H. Gainer, 
in Brain Peptides , D.T. Krieger, M.J. Brownstein, J.B. 
Martin, Eds. (Wiley-Interscience, New York, 1983), pp. Te- 
lle; M. Chretien, et al. in Cell Biol ocrv of the Secretory 
Process (Karger, Basel, Switzerland, 1983), pp. 214-246; 
A.J. Mason, et al.. Nature . 303:300 (1983); P.J. Isackson 
et al., J- cell. Biochem. . 33:65 (1987); I. Lindberg et 
al., J. Neurochem. . 42:1411 (1985); J. A. Cromlish et al., 
J. Biol. Chem. . 261:10850 (1986); K. Docherty et al., J^ 
Biol. Chem. , 259:6041 (1984); T.C. Chang and Y.P. Loh, 
Endocrinology , 114, 2092 (1984); B.P. Noe et al., 
Cell. Biol. , 99:578 (1984); U.P. Loh, J. Biol. Chem. . 
261 :11949 (1986); H.W. Davidson et al., Biochem. J., 
246:279 (1987); P. Gluschankof et al., J. Biol . Chem., 
262 :9615 (1987); C. Clamigrand et al., Biochem. . 26:6018 
(1987); S.O. Brennan and R.J. Peach, FEBS Letters, 
229 :167 (1988); R.S- Fuller et al., Proc. Na tl. Acad. 
Sci. USA , 86:1434 (1989); K. Mizuno et al., Biochem. 
Biophvs. Res. Comm. , 159:305 (1989); I.C. Bathurst et 
al.. Science . 215:348 (1987); and G. Thomas et al., 
Science , 241 :226 (1988)]. 

Despite the fact that these candidate 
activities and other processing enzymes have been 
proposed as being involved in the propeptide processing 
reactions, these endoproteolytic candidates have either 
not been fully characterized or have not been shown to be 
a bona fide precursor cleaving endoprotease in vivo . The 
purification of proprotein cleavage enzymes has been 
hampered by their low levels of activity in mammalian 
tissue and by their meiabrane-associated nature. 
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Purification of these specific proteases has been 
complicated additionally by non-specific cleavage of the 
assay substrates in vitro , and by contaminating proteases 
such as those released from lysosomes. 

The yeast enzyme Kex2/ encoded by the KEX2 
gene, is a membrane-bound, Ca**^**^ -dependent serine protease 
which functions late in the secretory pathway of 
Saccharomyces cerevisiae . The enzyme cleaves the 
polypeptide chains of prepro-killer toxin and prepro-a- 
factor of that microorganism at the paired basic amino 
acid sequences of Lys-Arg and Arg-Arg [D. Julius et al. 
Cell . 32:1075 (1984); D. Julius et al. Cell . 36:309 
(1984); K. Mizuno et al., Biochem. Biophvs. Res. Commun, . 
156:246 (1988); R.S. Fuller et al., Proc. Natl. Acad. 
Sci, USA . 86:1434 (1989)]. Kex-2 has been considered to 
be a prototypic proprotein convertase. 

Recently, co-expression of the yeast KEX2 
gene with POMC in maiamalian BSC-40 cells (a cell line 
which is incapable of processing this peptide precursor) 
reportedly resulted in the generation, by proteolytic 
cleavage at pairs of basic amino acids, of authentic 
neuroendocrine prohormone peptides, including y-LPH and 
6-endorphin [Thomas et al, (1988), cited above]. Foster 
et al. Thrombosis and Haemostasis . 62.:321 (1989) have 
reported that the yeast KEX2 gene product cleaves the 
Protein C precursor to a two-chain form when the yeast 
endoprotease of the KEX2 gene and the wild-type Protein C 
precursor are coexpressed. However, propeptide 
processing and the effect of Kex2 expression have not 
been studied. 

Two human DNA protease sequences, designated 
PC2 and fur , share some structural homology with each 
other and with the KEX2 gene sequence, PC2, a mammalian 
subtilisin-like protease, was identified by amplification 
of a human insulinoma cDNA library by the polymerase 
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chain reaction using KEX2-derived primers. PC2, which 
has been implicated in the endoproteolytic processing of 
prohormones, shares a partial homology to the yeast Kex2 
protease, especially in the putative active site domains 
[Smeekens et al, J- Biol, Chem., 265:2997 (1990)]. To 
date, however, no functional activity has been 
demonstrated for the PC2 clone. 

The availability of the complete Kex2 gene 
sequence also allowed the detection of significant 
homology between the Kex2 protein and "furin", the 
product of the partially characterized human fur gene. 
The fur locus was initially identified by its proximity 
(in the immediate upstream region) to the c-fes/fps 
proto-oncogene [A. J. M. Roebroek et al, EMBO J. . 5:2197 
(1986)]. The complete nucleotide sequence of the putative 
coding region of the fur gene has been reported. Upon 
comparison, the human fur gene product has demonstrated 
structural homology with the subtilisin-type serine 
protease encoded by the KEX2 gene of the yeast 
cerevisiae [A. M. W. van den Ouweland et al, Nucl. Acids 
Res. . 18(3) :664 (1990). This published cDNA coding 
sequence for fur is presented in Figure 1. See, also, 
R.S. Fuller et al. Science . 246 :482 (1989). However, no 
evidence of the expression of fur was reported. 

An expression system has been developed 
which utilizes baculovirus vectors to introduce 
heterologous genes into insect cells in culture and 
subsequently effects the expression of the heterologous 
polypeptide. This has proven successful for the 
recombinant expression of some proteins [see, e.g., G. Ju 
et al., Curr. Communic. in Mol> Biol. - Gene Transfer 
Vectors for Mammalian Cells , C.S.H.L. Press (1987) pps. 
39-45; and A. E. Atkinson et al,, Pestic. Sci. , 28:215- 
224 (1990)]. 
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There remains a need in the art for a method 
of increasing the efficiency of proteolytic processing of 
precursor polypeptides in recombinant host cells. 

5 Summary of the Invention 

In one aspect, the present invention 
provides a selected host cell comprising a recombinant 
polynucleotide encoding PACE, which cell is capable of 
expressing PACE. In various embodiments of this aspect 
10 of the invention, the host cell may be a microorganism, 

e.g., a bacterial or fungal cell, a mammalian cell or an 
insect cell. 

In a further aspect, the invention provides 
a selected host cell comprising a recombinant 

15 polynucleotide encoding PACE and a heterologous 

polynucleotide encoding a selected precursor polypeptide. 
The selected precursor polypeptide is preferably a 
substrate for the encoded PACE. This host cell is 
characterized by the ability to express both PACE and the 

20 heterologous precursor protein, which is then cleaved by 

the co-expressed PACE into its mature form. This host 
cell is thereby capable of producing high levels of PACE 
and the active, mature heterologous protein. In various 
embodiments of this aspect of the invention, the host 

25 cell may be a microorganism, e.g., a bacterial or fungal 

cell, a mammalian cell or an insect cell. 

In another aspect, the present invention 
provides a recombinant expression vector or DNA molecule 
comprising a polynucleotide sequence encoding PACE or a 

30 homolog thereof- The vector preferably provides the 

sequence encoding PACE operably linked to a regulatory 
sequence capable of directing the replication and 
expression of PACE in a selected host cell. 

In still another aspect, the recombinant 

35 expression vector or a DNA molecule of this invention 
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further comprises a polynucleotide sequence encoding a 
precursor polypeptide, which is a substrate for PACE. 
The coding sequences of the vector are operably linked 
with one or more suitable regulatory sequences capable of 
5 directing the replication and expression of PACE and the 

selected propeptide in a selected host cell^ 

In still a further aspect the invention 
provides a method for expressing PACE in a selected host 
cell, described above, which comprises culturing the 

10 selected cell comprising a PACE-encoding polynucleotide 

under conditions suitable for expressing PACE. 

In yet another aspect the invention provides 
a method for expressing PACE and a heterologous 
polypeptide in a selected host cell which comprises 

15 culturing a selected above-described cell comprising a 

PACE polynucleotide and a heterologous polynucleotide 
encoding a selected precursor polypeptide under suitable 
conditions permitting expression of both PACE and the 
heterologous polypeptide. This method may increase the 

20 efficiency of, or otherwise enhance the production of, a 

functional, mature protein, which protein requires 
processing by the enzyme PACE of a pro-peptide form for 
biological activity. The invention may also be used for 
the processing of 7-carboxylated proteins and other 

25 proteins not requiring gamma carboxylation, leading to 

higher levels of biologically active or otherwise useful 
proteins . 

The method may involve transforming a 
selected host cell with the recombinant expression 
30 vectors described above. This cell line is then cultured 

under appropriate conditions permitting expression of the 
recombinant protein (s). The expressed selected 
protein (s) is then harvested from the host cell or 
culture medium by suitable conventional means. 
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Other aspects and advantages of this 
invention are apparent from the following detailed 
description of the invention. 
Description of the Dravino 

Figure 1 illustrates the published fur DNA 
sequence of A.M. W. van den Ouweland et al, Nucl. Acids 
Res. , 18(3) :664 (1990). 

Figure 2 illustrates the composite cDNA 
secfuence encoding PACE, and the amino acids encoded 
therein, which differs from the above Figure 1 in the 
inclusion of the 5 ' untranslated region from nucleotide 
#-320 to -1, and the 3* untranslated region from 
nucleotide #2383 to 3974. 
Detailed Description of the Invention 

The present invention includes compositions 
(e.g., vectors, transformed host cells, recombinant 
polypeptides) and methods for producing, expressing, and 
also secreting, in selected host cells a mammalian 
endopeptidase, PACE, which is involved in the production 
Of mature polypeptides from precursor polypeptides by 
cleavage at pairs of basic amino acids (-LysArg-, 
-LysLys-, and -ArgArg-) . The compositions of the present 
invention, e.g., the recombinant polynucleotides, can be 
used for enhanced intracellular or extracellular 
production of PACE in various host cells, including 
microorganisms, e.g, bacteria and fungi; insect cells and 
mammalian cells. The production of PACE in these 
expression systems provides another embodiment of this 
invention, methods for the efficient processing and 
conversion of co-expressed heterologous precursor 
polypeptides having processing sites recognized by the 
PACE endopeptidase to desired mature forms of those 
polypeptides. The compositions of this invention are 
also useful for the production of the endopeptidase in 
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high yields for production of purified endopeptidase for 
commercial purposes. 

The practice of the present invention 
employs, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the 
skill of the art. Such techniques are explained fully in 
the literature. See, e.g., Sambrook, et al. , "Molecular 
Cloning; A Laboratory Manual^', 2nd ed. (1989); "DNA 
Cloning", Vols. I and II (D,N Glover ed. 1985); 
"Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 
"Nucleic Acid Hybridization" (B.D. Hames & S.J. Higgins 
eds. 1984); "Transcription and Translation" (B.D. Hames & 
S.J. Higgins eds. 1984); "Animal Cell Culture" (R.I. 
Freshney ed. 1986); "Immobilized Cells and Enzymes" (IRL 
Press, 1986); B. Perbal, "A Practical Guide to Molecular 
Cloning" (1984); the series. Methods in Enzvmolocrv 
(Academic Press, Inc.), particularly Vols. 154 and 155 
(Wu and Grossman, and Wu, eds., respectively); "Gene 
Transfer Vectors for Mammalian Cells" (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); 
"Immunochemical Methods in Cell and Molecular Biology", 
Mayer and Walker, eds. (Academic Press, London, 1987); 
Scopes, "Protein Purification: Principles and Practice", 
2nd ed. 1987 (Springer-Verlag, N.Y,); and "Handbook of 
Experimental Immunology", Vols. I-IV (D.M. Weir and C. C. 
Blackwell eds 1986). All patents, patent applications, 
and publications cited in the background and 
specification are incorporated herein by reference. 

The following definitions may be applied to 
terms employed in the description of embodiments of the 
invention. As used herein, the term "PACE" is an acronym 
for paired basic amino acid converting (or cleaving) 
enzyme. PACE, originally isolated from a human liver 
cell line, is a subtilisin-like endopeptidase, i.e., a 
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propeptide-cleaving enzyme which exhibits specificity for 
cleavage at basic residues of a polypeptide, e.g., -Lys- 
Arg-, -Arg-Arg, or -Lys-Lys-. PACE is stimulated by 
calcium ions; and inhibited by phenylmethyl sulfonyl 
5 fluoride (PMSF) . A DNA sequence encoding PACE (or fur in) 

was published in A.M.W. van den Ouweland et al, cited 
above, and appears in Figure 1. 

A cDNA encoding at least one form of PACE, 
derived from an animal cell, more specifically from a 

10 human cell, is presented in Figure 2. It is anticipated 

that other forms of PACE exist or that they can be 
created. PACE, as described herein, may be encoded by 
DNA sequences that differ in sequence from the published 
sequence and the sequence of Figure 2 due to natural 

15 allelic or species variations. Thus, the term "PACE" 

refers to any of the naturally occurring forms of PACE, 
including the PACE precursor shown in Figure 2 and 
various processed forms, including the mature PACE 
polypeptide. 

20 Similarly the term PACE may include 

fragments of the PACE DNA and amino acid sequences or 
deliberately modified sequences thereof that maintain the 
catalytic specificity of that enzyme. Therefore, 
provided that the biological activities of mediating 

25 propeptide cleavage and/ or 7-carboxylation are retained 

in whole or part despite such modifications, this 
invention encompasses the use of all such DNA sequences. 
The term "PACE" as used herein thus encompasses the 
peptide and DNA sequences specifically disclosed herein 

3 0 as well as analogs thereof retaining PACE biological 

activity. 

Analogs of PACE included within the 
definition may include truncated polypeptides (including 
fragments) and PACE-like polypeptides, e.g., mutants, 
35 that retain catalytic activity and preferably have a 
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homology to Figure 1 or 2 of at least 80%, more 
preferably 90%, and most preferably 95%. Typically, such 
analogs differ by only 1, 2, 3, or 4 codon changes. 
Examples include polypeptides with minor amino acid 
5 variations from the natural amino acid secpience of PACE; 

in particular, conservative amino acid replacements. 
Conservative replacements are those that take place 
within a family of amino acids that are related in their 
side chains. Genetically encoded amino acids are 

10 generally divided into four families: (1) acidic = 

aspartate, glutamate; (2) basic = lysine, arginine, 
histidine; (3) non-polar = alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, 
tryptophan; and (4) uncharged polar = glycine, 

15 asparagine, glutamine, cystine, serine, threonine, 

tyrosine. Phenylalanine, tryptophan, and tyrosine are 
sometimes classified jointly as aromatic amino acids. 
For example, it is reasonable to expect that an isolated 
replacement of a leucine with an isoleucine or valine, an 

20 aspartate with a glutamate, a threonine with a serine, or 

a similar conservative replacement of an amino acid with 
a structurally related amino acid will not have a major 
effect on the enzymatic activity, especially if the 
replacement does not involve an amino acid at the active 

25 site of the PACE-like polypeptide. 

Utilizing the sequence data in Figure 2, as 
well as the denoted characteristics of PACE, it is within 
the skill of the art to obtain other DNA sequences 
encoding PACE. For example, the structural gene may be 

30 manipulated by varying individual nucleotides, while 

retaining the correct amino acid(s), or varying the 
nucleotides, so as to modify the amino acids, without 
loss of enzymatic activity. Nucleotides may be 
substituted, inserted, or deleted by known techniques, 
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including, for example, in vitro mutagenesis and primer 
repair. 

The structural gene may be truncated at its 
3 '-terminus and/or its 5 '-terminus while retaining its 
endopeptidase activity. For example, PACE as encoded in 
Figure 2 contains a putative transmembrane domain which 
may serve to anchor it in the membranes of the Golgi in 
the cell in which it is expressed. Additionally, it may 
be desirable to delete the transmembrane (TM) region 
and/or the cysteine-rich region (CRR) . It also may be 
desirable to remove the region encoding the signal 
sequence, and/ or to replace it with a heterologous 
sequence. 

It may also be desirable to ligate a portion 
of the PACE sequence (particularly that which includes 
the catalytic domain) to a heterologous coding sequence, 
and thus to create a fusion peptide with the enzymatic 
specificity of PACE. 

In addition to the above, other open reading 
frames (ORFs) or structural genes encoding PACE may be 
obtained and/or created from cDNA libraries from other 
animal cell sources. 

As used herein, the term "polypeptide" 
refers to a polymer of amino acids and does not refer to 
a specific length of the product; thus, peptides, 
oligopeptides, and proteins are included within the 
definition of polypeptide. This term also does not refer 
to or exclude post-expression modifications of the 
polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like. Included within the 
definition are, for example, polypeptides containing one 
or more analogs of an amino acid (including, for example, 
unnatural amino acids, etc.), polypeptides with 
substituted linkages, as well as other modifications 
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known in the art, both naturally occurring and 
non-naturally occurring. 

The term "precursor polypeptide" denotes an 
expressed polypeptide which normally undergoes one or 
5 more posttranslational proteolytic cleavages to yield a 

biologically active mature polypeptide. Included within 
the term "precursor polypeptide" are "prepr ©polypeptides" 
and "propolypeptides. " 

A "prepeptide" is the portion of a precursor 

10 polypeptide which is removed by "signal peptidase" 

cleavage during translocation of the polypeptide into the 
endoplasmic reticulum. The "prepeptide" region is 
usually at or near the amino terminus. 

A "propeptide" is the portion of a precursor 

15 polypeptide which is removed by a "propolypeptide 

convertase" or "endopeptidase" (for example, Kex2 and 
PACE) during the maturation process of the polypeptide. 
Many proteins, such as plasma proteins, hormones, 
neuropeptides, and growth factors, are translated with an 

20 additional "propeptide" region located to the carboxy 

side of the prepeptide region. After cleavage of the 
prepeptide, the "propeptide" segment is cleaved by a 
site-specific endopeptidase contributing to the 
maturation of the polypeptide. A "mature" form of a 

25 polypeptide has had a prepeptide and/or propeptide region 

removed. 

A polypeptide or amino acid secfuence 
"derived from" a designated nucleic acid sequence refers 
to a polypeptide having an amino acid sequence identical 

30 to that of a polypeptide encoded in the sequence, or a 

portion thereof wherein the portion consists of at least 
3-5 amino acids, and more preferably at least 8-10 amino 
acids, and even more preferably at least 11-15 amino 
acids, or which is immunologically identifiable with a 

35 polypeptide encoded in the sequence. This terminology 
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also includes a polypeptide expressed from a designated 
nucleic acid sequence. 

A recoiabinant or derived polypeptide is not 
necessarily translated from a designated nucleic acid 
5 sequence, for example, the sequence in Figure 2. It may 

be generated in any manner, including for example, 
chemical synthesis, or expression of a recombinant 
expression system, or isolation from a cell. A 
recombinant or derived polypeptide may include one or 

10 more analogs of amino acids or unnatural amino acids in 

its sequence. Methods of inserting analogs of amino 
acids into a sequence are known in the art. It also may 
include one or more labels, which are known to those of 
skill in the art. 

15 The tera "recombinant polynucleotide" as 

used herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of 
its origin or manipulation: (1) is not associated with 
all or a portion of a polynucleotide with which it is 

20 associated in nature, (2) is linked to a polynucleotide 

other than that to which it is linked in nature, or (3) 
does not occur in nature. 

The term "polynucleotide" as used herein 
refers to a polymeric form of nucleotides of any length, 

25 either ribonucleotides or deoxyribonucleotides . This 

term refers only to the primary structure of the 
molecule. Thus, this term includes double- and 
single-stranded DNA and RNA. It also includes known 
types of modifications, for example, labels which are 

30 known in the art, methylation, "caps", substitution of 

one or more of the naturally occurring nucleotides with 
an analog. Other known modifications include 
internucleotide modifications, for example, those with 
uncharged linkages (methyl phosphonates, phospho- 

35 triesters, phosphoamidates, carbamates, etc.) and with 
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charged linkages (phosphorothioates, phosphorodithioates, 
etc.)f those containing pendant moieties, such as, 
proteins {including nucleases, toxins, antibodies, signal 
peptides, poly-L-lysine, etc.), those with intercalators 
(acridine, psoralen, etc.)/ those containing chelators 
(metals^ radioactive metals, boron, oxidative metals, 
etc.)/ those containing alkylators, those with modified 
linkages (alpha anomeric nucleic acids, etc.), as well as 
unmodified forms of the polynucleotide. 

A "replicon" is any genetic element that 
behaves as an autonomous unit of polynucleotide replica- 
tion within a cell, that is, capable of replication under 
its own control. Thus a replicon may include, without 
limitation, a selectable marker, a plasmid, a chromosome, 
a virus, a cosmid. 

A "vector" is a replicon in which another 
polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment. 

A "Control sequence" or "Regulatory 
sequence" refers to polynucleotide sequences which are 
necessary to effect the replication and expression of 
coding sequences to which they are ligated. The nature 
of such control sequences differs depending upon the host 
organism. In prokaryotes, such control sequences 
generally include promoter, ribosomal binding site, and 
transcription termination sequences. In eukaryotes, 
generally, such control sequences include promoters and 
transcription termination sequences. The term "control 
sequences" is intended to include, at a minimum, all 
components whose presence is necessary for expression in 
a selected host cell, and may also include additional 
components whose presence is advantageous, for example, 
leader sequences and fusion partner sequences. 
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"Operably linked", or related terms such as 
"operative association", refer to the relationship 
between the components so described which permits them to 
function in their intended manner. A control sequence 
"operably linked" to a coding sequence is ligated in such 
a way that expression of the coding sequence is achieved 
under conditions compatible with the control sequences. 

An "open reading frame" (ORF) is a region of 
a polynucleotide sequence which encodes a polypeptide. 
This region may represent a portion of a coding sequence 
or a total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is translated into a polypeptide, usually 
via mRNA, when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding 
sequence are determined by a translation start codon at 
the 5 '-terminus and a translation stop codon at the 
3 '-terminus. A coding sequence can include, but is not 
limited to, cDNA, and recombinant polynucleotide 
sequences . 

"PGR" refers to the technique of polymerase 
chain reaction as described in Saiki et al.. Nature . 
324:163 (1986); U.S. Patent No. 4,683,195; and U.S. 
Patent No. 4,683,202. Other known PGR modifications are 
also included by use of this acronym. 

As used herein, x is "heterologous" with 
respect to y if x is not naturally associated with y in 
the identical manner; i.e., x is not associated with y in 
nature or x is not associated with y in the same manner 
as is found in nature. 

"Recombinant host cells", "host cells", 
"cells", "cell lines", "cell cultures", and other such 
terms denote selected host cells, e.g., mammalian, insect 
or microorganism cells, that can be, or have been, used 
as recipients for a recombinant vector or other transfer 
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DNA, These terms include the progeny of the original 
cell which has been transformed. It is understood that 
the progeny of a single parental cell may not necessarily 
be completely identical in morphology or in genomic or 
5 total DNA complement as the original parent, due to 

natural, accidental, or deliberate mutation. 

As used herein, the term "microorganism" 
includes prokaryotic and eukaryotic microbial species 
such as bacteria and fungi. Fungi include yeast and 

10 filamentous fxangi. The term "microorganism" specifically 

excludes mammalian cells and insect cells. 

"Mammalian cells" are cells that are from a 
member of the Class Mammalia, and specifically exclude 
microorganism cells and insect cells. 

15 Insect cells and compatible vectors which 

are useful as recombinant expression systems are known in 
the art. Examples include insect expression and transfer 
vectors derived from the baculovirus Autoarapha 
californica nuclear polyhedrosis virus (hereinafter 

20 "AcNPV" or "baculovirus") , which is a helper-independent, 

viral expression vector. Viral expression vectors 
derived from this system usually use the strong viral 
polyhedrin gene promoter to drive expression of 
heterologous genes. 

25 "Transformation", as used herein, refers to 

the insertion of an exogenous polynucleotide into a host 
cell, irrespective of the method used for the insertion. 
Examples include direct uptake, transf ection, f -mating, 
transduction, infection or electroporation. The 

30 exogenous polynucleotide may be maintained as a 

non-integrated vector, for example, a plasmid, or 
alternatively, may be integrated into the host genome. 

The inventors have discovered that the 
enzyme PACE may be recombinantly expressed in a variety 

35 of host cells, including mammalian cells, microorganisms 
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and insect cells. One method of this invention employs a 
single transformed host cell expressing PACE. A 
polynucleotide sequence encoding PACE or a biologically 
active fragment thereof may be inserted into an 
expression vector and operably linked to expression 
control sequences suitable for expression of the enzyme 
in the selected host cell. Transformation or 
transfection of the vector into the selected host cell 
can be effected using materials and methods conventional 
for introducing polynucleotides into a host cell. Among 
such methods are packaging the polynucleotide in a virus 
and transducing a host cell with the virus or by 
transfection procedures known in the art, as exemplified 
by U.S. Patent Nos. 4,399,216; 4,912,040; 4,740,461; 
4,959,455 (these patents are incorporated herein by 
reference) . The transformation procedure used depends 
upon the host to be transformed. Once the vector is 
transformed into the selected host cell, the cell is 
cultured to express PACE. 

In order to obtain PACE expression, 
recombinant host cells derived from the transf ormants are 
incubated under conditions which allow expression of the 
recombinant PACE encoding sequence. These conditions 
will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to 
those of ordinary skill and knowledge in the art. 

Detection of PACE expressed in the 
transformed host cell may be by several methods. For 
example, detection can be by enzymatic activity (or 
increased enzymatic activity or increased longevity of 
enzymatic activity) using fluorogenic substrates which 
are comprised of a dibasic cleavage site for which PACE 
is specific. PACE may also be detected by its 
immunological reactivity with anti-PACE antibodies. 
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PACE may be isolated from the cell by lysis, 
if formed intracellular ly, or isolated from the culture 
medium, if secreted, by conventional methods. If the 
transmembrane domain is retained during expression so 
5 that the PACE localizes in the host cell membranes, the 

host cells may be lysed and the membrane fragments 
isolated by conventional techniques. These fragments 
containing enriched amounts of PACE may be used as is, or 
fixed to a solid substrate for use in processing 

10 precursor polypeptides. The cell membranes may be 

dispersed in a medium at optimal pH, or particle bound 
membrane may be packed in a column. Other useful 
configurations may also be employed. 

Recombinantly expressed PACE can improve the 

15 efficiency of cleavage of a precursor polypeptide between 

the dibasic residues Lys-Arg, Lys-Lys or Arg-Arg into its 
mature form. Thus another embodiment of this invention 
is provided by the action of recombinantly-expressed PACE 
on selected precursor polypeptides, either recombinant or 

20 naturally occurring. The expressed precursor will be one 

which has a processing site recognized by PACE. 

As one example, the recombinantly-expressed 
PACE may be used for the in vitro conversion of 
heterologous precursor polypeptides to mature 

25 polypeptides. Soluble recombinant PACE, i.e., a 

truncated PACE polypeptide lacking a transmembrane 
domain, may be used as an added reagent to extracellular 
(or conditioned) media where a precursor product is 
secreted from the cell in which it is expressed. 

3 0 More preferably, the co-expression of PACE 

and a proprotein which requires such processing for 
production of the mature protein is an embodiment of this 
invention, which can result in high level expression of 
the mature protein. Additionally, the inventors have 

35 also surprisingly discovered that co-expression of PACE 
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with proteins requiring 7-carboxylation for biological 
activity permits the expression of increased yields of 
functional, biologically active mature proteins in 
eukaryotic, preferably mammalian, cells. 
5 Examples of precursor polypeptides for use 

in the present invention include, but are not limited to, 
transforming growth factor (TGF) beta and its 
superfamily, including inhibin and activin; bone 
morphogenic proteins (BMP) ; insulin and relaxin; 

10 coagulation factors, such as von Willebrand factor (vWF) ; 

Factor IX, Protein C, Protein S, Prothrombin Factor 10, 
Factor VII and bone gamma-carboxyglutamate protein, 
growth factors, such as platelet derived growth factor 
(PDGF) and nerve growth factor (NGF) ; and virus 

15 polypeptides, including those from cytomegalovirus (CMV) , 

hepatitis delta virus (HDV) , hepatitis C virus (HCV) , 
human immunodeficiency virus (HIV) , and herpes simplex 
virus (HSV) . Any precursor polypeptide with at least one 
dibasic cleavage site is a candidate for the present 

20 invention. 

Methods for producing a desired mature 
polypeptide by co-expression with PACE can include the 
following techniques- First, a single vector containing 
coding sequences for both PACE and the heterologous 

25 precursor polypeptide can be inserted into a selected 

host cell. Alternatively, two separate vectors coding, 
respectively, for PACE and the heterologous precursor 
polypeptide, can be inserted into a host. Upon culturing 
under suitable conditions for the selected host cell, the 

3 0 two polypeptides are produced and interact to provide 

cleavage of the proprotein into the mature protein. 

Another alternative is the use of two 
transformed host cells wherein one host cell expresses 
soluble recombinant PACE and the other host cell 

35 expresses the heterologous precursor polypeptide which 
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will be secreted into the medium. These host cells can 
be co-cultured under conditions which allow expression 
and secretion or release of the recombinant PACE, as well 
as expression, secretion or release of the precursor 
5 polypeptide, and its cleavage into the mature form by the 

extracellular PACE. In this method, it is preferred that 
the PACE polypeptide lacks the transmembrane domain so 
that it secretes into the medium. 

In some instances, it may be desirable to 

10 have a plurality of copies, two or more, of the gene 

expressing the expression product precursor in relation 
to the PACE gene, or vice versa. This can be achieved in 
a variety of ways. For example, one may use separate 
vectors or plasmids, where the vector containing the PACE 

15 encoding polynucleotide has a higher copy number than the 

vector containing the polynucleotide sequence encoding 
the heterologous precursor polypeptide, or vice versa. 
In this situation, it would be desireJole to have 
different markers on the two plasmids, so as to ensure 

20 the continued maintenance of the plasmids in the host. 

Alternatively, one or both genes could be integrated into 
the host genome, and one of the genes could be associated 
with an amplifying gene, (e.g., dhfr or one of the 
metallothionein genes) . 

25 Alternatively, one could employ two 

transcriptional regulatory regions having different rates 
of transcriptional initiation, providing for the enhanced 
expression of either the PACE gene or the expression of 
the precursor polypeptide, relative to the other gene. 

30 As another alternative, one can use different promoters, 

where one promoter provides for a low level of 
constitutive expression of either PACE or the precursor 
polypeptide, while the second promoter provides for a 
high level of induced expression of the other product. A 

35 wide variety of promoters are known for the selected host 
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cells, and can be readily selected and employed in the 
invention by one of skill in the art. 

By use of these methods, the natural level 
of PACE may be greatly enhanced and/or the longevity of 
protease activity may be increased, so as to more 
efficiently process the expression product precursor. 

A, Mammalian Expression of PACE 

The methods of the present invention may be 
performed by inserting a polynucleotide sequence encoding 
PACE or a fragment thereof into a suitable mammalian 
expression vector. The vector containing PACE is then 
transformed into a selected mammalian cell line. The 
establishment of cell lines which express PACE provides a 
convenient and efficient mechanism for the high level 
production of PACE, as well as for the production of more 
completely processed and biologically active proteins. 

Where the method involves the co-expression 
of PACE and a precursor polypeptide, a single vector can 
carry the PACE DNA and another vector can carry the 
selected precursor DNA, each under the control of a 
selected expression control sequence. Alternatively, 
both the PACE and precursor DNA sequences may be carried 
on a single recombinant vector molecule in which case 
they may be operably linked to respective expression 
control sequences or may share a common expression 
control sequence. As another alternative, a vector 
containing the PACE DNA may be transfected into a host 
cell line known to express the desired proprotein, or a 
vector containing the DNA for the desired protein may be 
transfected into a cell known to express PACE. 

Vector construction employs techniques which 
are known in the art. Site-specific DNA cleavage 
involved in such construction is performed by treating 
with suitable restriction enzymes under conditions which 
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generally are specified by the manufacturer of these 
commercially available enzymes. 

A suitable expression vector is one that is 
compatible with the desired function (e.g., transient 
5 expression, long term expression, integration, 

replication, amplification) and in which the control 
elements are compatible with the host cell. In general, 
the vectors employed will contain selected regulatory 
sequences operably linked with the DNA coding sequences 
10 of PACE and selected precursor and capable of directing 

the replication and expression thereof in selected host 
cells. 

Vectors suitable for replication in 
mammalian cells may include viral replicons, or sequences 

15 that ensure integration of the sequence encoding PACE 

into the host genome. Suitable vectors may include, for 
example, those derived from simian virus SV40, 
retroviruses, bovine papilloma virus, vaccinia virus, and 
adenovirus. The components of the vectors, e.g. 

20 replicons, selection genes, enhancers, promoters, and the 

like, may be obtained from natural sources or synthesized 
by known procedures. [See, Kaufman et al, J. Mol, Biol., 
159 :511*521 (1982); and Kaufman, Proc. Natl. Acad. Sci. , 
USA , 82:689-693 (1985)]. 

25 A suitable vector, for example, is one 

derived from vaccinia viruses. In this case, the 
heterologous DNA is inserted into the vaccinia genome. 
Techniques for the insertion of foreign DNA into the 
vaccinia virus genome are known in the art, and utilize, 

3 0 for example, homologous recombination. The insertion of 

the heterologous DNA is generally into a gene which is 
non-essential in nature, for example, the thymidine 
kinase gene (tk) , which also provides a selectable 
marker. Plasmid shuttle vectors that greatly facilitate 

35 the construction of recombinant viruses have been 
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described [see, for example, Mackett et al. (1984), 
Chakrabarti et al. (1985); Moss (1987)]. Expression of 
the heterologous polypeptide then occurs in cells or 
individuals which are immunized with the live recombinant 
vaccinia virus. 

Suitable mammalian expression vectors 
usually contain one or more eukaryotic transcription 
units that are capable of expression in mammalian cells. 
The transcription unit is comprised of at least a 
promoter element to mediate transcription of foreign DNA 
sequences. Suitable promoters for mammalian cells are 
known in the art and include viral promoters such as that 
from simian virus 40 (SV4 0) , cytomegalovirus (CMV) , Rous 
sarcoma virus (RSV) , adenovirus (ADV) , and bovine 
papilloma virus (BPV) . 

In addition, the transcription unit may also 
be comprised of a termination sequence and poly (A) 
addition sequences which are operably linked to the PACE 
and/or precursor coding sequence (s) . The transcription 
unit may also be comprised of an enhancer sequence which 
increases the expression of PACE and/ or the precursor. 

The optional presence of an enhancer element 
(enhancer) , combined with the promoter elements described 
above, will typically increase expression levels. An 
enhancer is any regulatory DNA sequence that can 
stimulate transcription up to 1000-fold when linked to 
endogenous or heterologous promoters, with synthesis 
beginning at the normal mRNA start site. Enhancers are 
also active when they are placed upstream or downstream 
from the transcription initiation site, in either normal 
or flipped orientation, or at a distance of more than 
1000 nucleotides from the promoter [Maniatis et al . 
Science . 236 : 1237 (1987); Alberts et al.. Molecular 
Biology of the Cell . 2nd ed. (1989)]. Enhancer elements 
derived from viruses may be particularly useful, because 



SUBSTITUTE SHEET 



wo 92/09698 



PCT/US91/08725 



26 

they typically have a broader host range. Examples 
include the SV40 early gene enhancer [Dijkema et al, EMBO 
J, ^ 4:761 (1985)] and the enhancer/promoters derived from 
the long terminal repeat (LTR) of the Rous Sarcoma Virus 
5 [Gorman et al. , Proc, Natl - Anad. Sci. 79;6777 (1982b)] 

and from human cytomegalovirus [Boshart et al.. C ell , 
41:521 (1985)]. Additionally, some enhancers are 
regulatable and become active only in the presence of an 
inducer, such as a hormone or metal ion [Sassone-Corsi 

10 and Borelli, Trends Genet. 2 :215 (1986); Maniatis et al. 

Science , 216:1237 (1987)]. 

Sequences which cause amplification of the 
gene may also be desirable, as are sequences which encode 
selectable markers. Selectable markers for mammalian 

15 cells are known in the art, and include for example, 

thymidine kinase, dihydrof olate reductase (together with 
methotrexate as a DHFR amplifier) , aminoglycoside 
phosphotransferase, hygromycin B phosphotransferase, 
asparagine synthetase, adenosine deaminase, 

20 metallothionien, and antibiotic resistant genes such as 

neomycin. 

Alternatively, the vector DNA may include 
all or part of the bovine papilloma virus genome [Lusky 
et al. Cell . 36:391-401 (1984)] and be carried in cell 
25 lines such as C127 mouse cells as a stable episomal 

element. 

The vector used in the examples below is 
pMT3, a derivative of the previously described vector 
pMT2 [R. Kaufman, MoT. Cell, Biol. , 9:946-958 (1989)]. 

30 One skilled in the art can also construct other mammalian 

expression vectors comparable to the pMT3/PACE vector 
(see Example 1) by, e.g. inserting the DNA sequence of 
PACE from pMT3 into another vector, such as pJL3, pJL4 
[Gough et al., EMBO J. , 4:645-653 (1985)], employing 

35 well-known recombinant genetic engineering techniques. 
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The mammalian cell expression vectors described herein 
may be synthesized by techniques well known to those 
skilled in this art. Other appropriate expression 
vectors of which numerous types are known in the art for 
5 mammalian expression can also be used for this purpose. 

One or more selected vector (s) encoding PACE 
and/or the precursor polypeptide can be used for 
transformation of a suitable mammalian host cell. 
Methods for introduction of heterologous polynucleotides 

10 into mammalian cells are known in the art and include 

dextran-mediated transf ection, calcium phosphate 
precipitation, polybrene mediated transf ection, 
protoplast fusion, electroporation, encapsulation of the 
polynucleotide (s) in liposomes, and direct microinjection 

15 of the DNA into nuclei. 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 
Culture Collection (ATCC) . Exemplary mammalian host 

2 0 cells include particularly primate cell lines and rodent 

cell lines, including transformed cell lines. Preferably 
for stable integration of the vector DNA, and for 
subsequent amplification of the integrated vector DNA, 
both by conventional methods, Chinese hamster ovary (CHO) 

25 cells are employed as a mammalian host cell of choice. 

Other suitable cell lines include, but are not limited 
to, HeLa cells, baby hamster kidney (BHK) cells, monkey 
kidney cells (COS-1) , human hepatocellular carcinoma 
cells (e.g.. Hep G2), human adenovirus transformed 293 

30 cells, mouse L-929 cells, HaK hamster cell lines, murine 

3T3 cells derived from Swiss, Balb-c or NIH mice and a 
number of other cell lines. Another suitable mammalian 
cell line is the CV-1 cell line. Normal diploid cells, 
cell strains derived from in vitro culture of primary 

35 tissue, as well as primary explants, are also suitable. 



SUB3TITUT!? SHFFT 



wo 92/09698 



PCT/US91/08725 



Candidate cells may be genotypically deficient in the 
selection gene, or may contain a dominant ly acting 
selection gene. 

The selection of suitable mammalian host 
5 cells and methods for transformation, culture, 

amplification, screening and product production and 
purification are known in the art. See, e.g., Gething 
and Sambrook, Nature , 293:620-625 (1981), or 
alternatively, Kaufman et al, Mol, Cell. Biol., 
10 5(7) :1750-1759 (1985) or Howley et al, U. S. Patent 

4,419,446. 

The host cells transformed with the one or 
more vectors carrying the PACE DNA and the selected 
precursor DNA are selected, e.g. by conventional means, 

15 and may then be cultured under suitable conditions if 

desired, with amplification of one or both introduced 
genes. The method of this present invention therefore 
comprises culturing a suitable cell or cell line, which 
has been transformed with a DNA sequence coding for PACE 

20 and a DNA sequence coding for the selected precursor, 

each coding sequence under the control of a 
transcriptional regulatory sequence- The expressed 
mature protein is then recovered, isolated and purified 
from the culture medium (or from the cell, if expressed 

25 intracellularly) by appropriate means known to one of 

skill in the art. 

With respect to 7-carboxylated proteins, it 
is presently and theoretically contemplated that the 
expression of PACE in mammalian cells increases the 

3 0 efficiency of 7-carboxylation, a post-trans lational 

modification required for biological activity of certain 
mature proteins. The method is especially useful in the 
processing of vitamin K-dependent blood coagulation 
proteins- More specifically the method is useful in 

35 processing and 7-carboxylating other proteins including 
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Protein C, Protein S, Prothrombin Factor IX, Factor VII, 
Factor X and bone 7-carboxyglutamate protein. For 
example, co-expression with PACE with such a propeptide 
permits high level recombinant expression of biologically 
5 active mature proteins. 

In addition, high levels of recombinant 
expression of functional proteins can also be achieved by 
use of the present method by expressing PACE with more 
completely processed proteins expressed from other genes. 

10 For example, coexpression of PACE with non-Vitamin K 

dependent propeptides which require cleavage, but not 7- 
carboxylation, for biological activity may produce high 
yields of functional mature proteins. 

One such protein which may be expressed in 

15 high functional yields by the present method is bone 

morphogenic protein (BMP), particularly BMP-2 [see, e.g., 
E. Wang et al, Proc. Natl. Acad> Sci. USA . 87 ; 2220-2224 
(1990) , which is incorporated by reference herein for 
information about that protein] • Other such proteins 

2 0 which may be produced in high functional yields by the 

present invention include tumor growth factor )8 (TGF-i8) 
and platelet-derived growth factor (PDGF) and the 
precursors identified specifically above. 

Further, the present invention also 
25 encompasses the use of recoitibinant-derived PACE for in 

vitro processing of nerve growth factor and monobasic 
propiomelanocortin. PACE may also be useful in the 
processing of proteins, such as insulin, and for the 
maturation of viruses, such as HIV and Hepatitis C, which 

3 0 also require precursor processing at paired basic amino 

acid residues. 

While mammalian cells are preferred as hosts 
for the co-expression of PACE and a mammalian proprotein, 
it is anticipated that microorganism and insect cells may 
35 be suitable hosts for such expression of PACE and 
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mammalian proproteins, as well as expression, where 
desired of proproteins of microbial or insect origin, 

p. Expression of PACE in Microor gapism Cells 
5 The PACE gene or a fragment thereof can be 

expressed in a exi3caryotic or prokaryotic microorganism 
system, such as fungi, including yeast, or bacteria. 
Fragments can include truncated forms of the PACE gene. 
Examples of truncation include, but are not limited to, 
10 deletion of the transmembrane region and/ or the cysteine- 

rich region. 

Fungal expression systems can utilize both 
yeast and filamentous fungi hosts. Examples of 
filamentous fungi expression systems are Aspergillus, as 

15 described in EPO Pub. No. 357 127 (published March 7, 

1990) , and Acremonium chrvsoaenum , described in EPO Pub. 
No. 376 266 (published July 4, 1990). 

A yeast expression system can typically 
include one or more of the following: a promoter 

20 sequence, fusion partner sequence, leader sequence, 

transcription termination sequence. These elements can 
be combined into an expression cassette, which may be 
maintained in a replicon, preferably with a selectable 
marker . 

25 A yeast promoter is any DNA sequence capable 

of binding yeast RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g. 
structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed 

30 proximal to the 5' end of the coding sequence. This 

transcription initiation region typically includes an RNA 
polymerase binding site (the "TATA Box") and a 
transcription initiation site. A yeast promoter may also 
have a second domain called an upstream activator 

35 sequence (UAS) , which, if present, is usually distal to 
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the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs 
in the absence of a UAS. Regulated expression may be 
either positive or negative, thereby either enhancing or 
reducing transcription. 

Yeast is a fermenting organism with an 
active metabolic pathway, therefore sequences encoding 
enzymes in the metabolic pathway provide particularly 
useful promoter sequences. Examples include alcohol 
dehydrogenase (ADH) (E.P.O. Pub. No. 284044), enolase, 
glucokinase, glucose-6-phosphate isomerase, 
glyceraldehyde-3-phosphate-dehydrogenase (GAP or GAPDH) , 
hexokinase , phosphof ructokinase , 3 -phosphoglycerate 
mutase, and pyruvate kinase (PyK) (E.P.O. Pub. No. 
329203) . The yeast PH05 gene, encoding acid phosphatase, 
also provides useful promoter sequences [Myanohara et 
al., Proc. Natl. Acad. Sci. USA . 80:1 (1983)]. 

In addition, synthetic promoters which do 
not occur in nature also function as yeast promoters. 
For example, UAS sequences of one yeast promoter may be 
joined with the transcription activation region of 
another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the 
ADH regulatory sequence linked to the GAP transcription 
activation region [U.S. Patent Nos. 4,876,197 and 
4,880,734]. Other examples of hybrid promoters include 
promoters which consist of the regulatory sequences of 
either the ADH2, GAL4 . GALIO, or PH05 genes, combined 
with the transcriptional activation region of a 
glycolytic enzyme gene such as GAP or PyK [E.P.O. Pub. 
No. 164556]. Furthermore, a yeast promoter can include 
naturally occurring promoters of non-yeast origin that 
have the ability to bind yeast RNA polymerase and 
initiate transcription. Examples of such promoters 
include, inter alia . [Cohen et al., Proc. Natl. Acad. 
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5ci. USA , 77:1078 (1980); Henikoff et al., yJature 283:835 
(1981); Hollenberg et al., curr. Topics Microbiol > 
Immunol, . 96:119 (1981); Hollenberg et al., "The 
Expression of Bacterial Antibiotic Resistance Genes in 
5 the Yeast Saccharomyces cerevisiae," in: P3.asmids of 

Medical. Environmental and Comme rcial Importance (eds. 
K.N. Timmis and A. Puhler, 1979); Mercerau-Puigalon et 
al.. Gene. 11:163 (1980); and Panthier et al., Curr. 
Genet. . 2:109 (1980)]. 

10 The PACE gene or a fragment thereof may be 

expressed intracellular ly in yeast. A promoter sequence 
may be directly linked with the PACE gene or fragment, in 
which case the first amino acid at the N-terminus of the 
recombinant protein will always be a methionine, which is 

15 encoded by the ATG start codon. If desired, methionine 

at the N-terminus may be cleaved from the protein by in 
vitro incubation with cyanogen bromide. 

Intracellularly expressed fusion proteins 
provide an alternative to direct expression of the PACE 

20 gene or fragment. Typically, a DNA sequence encoding the 

N-terminal portion of a stable protein, a fusion partner, 
is fused to the 5« end of heterologous DNA encoding the 
desired polypeptide. Upon expression, this construct 
will provide a fusion of the two amino acid sequences. 

25 For example, the yeast or human superoxide dismutase 

(SOD) gene, can be linked at the 5" terminus of the PACE 
gene or fragment thereof and expressed in yeast. The DNA 
sequence at the junction of the two amino acid sequences 
may or may not encode a cleavable site. See, e.g., 

30 E.P.O. Pub. No. 196056. Another example is a ubiquitin 

fusion protein. Such a ubiquitin fusion protein 
preferably retains a site for a processing enzyme (e.g. 
ubiquitin-specif ic processing protease) to cleave the 
ubiquitin from the PACE polypeptide. Through this 
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method, therefore, a mature PACE polypeptide can be 
isolated [see, P.C.T, WO 88/024066]. 

Alternatively, PACE polypeptides can also be 
secreted from the cell into the growth media by creating 
chimeric DNA molecules that encode a fusion protein 
comprised of a leader sequence fragment that provides for 
secretion in yeast of the PACE polypeptides. Preferably, 
there are processing sites encoded between the leader 
fragment and the PACE gene or fragment thereof that can 
be cleaved either in vivo or in vitro . The leader 
sequence fragment typically encodes a signal peptide 
comprised of hydrophobic amino acids which direct the 
secretion of the protein from the cell. 

DNA encoding suitable signal sequences can 
be derived from genes for secreted yeast proteins, such 
as the yeast invertase gene [E.P.O. Pub. No. 12873; 
J. P.O. Pub. No. 62,096,086] and the A-factor gene [U.S. 
Patent No. 4,588,684]. Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that 
also provide for secretion in yeast [E.P.O. Pub. No. 
60057] . 

A preferred class of secretion leaders are 
those that employ a fragment of the yeast alpha-factor 
gene, which contains both a "pre" signal sequence, and a 
"pro" region. The types of alpha-factor fragments that 
can be employed include the full-length pre-pro alpha 
factor leader (about 83 amino acid residues) as well as 
truncated alpha-factor leaders (typically about 25 to 
about 50 amino acid residues) [U.S. Patent Nos. 4,546,083 
and 4,870,008; and E.P.O. Pub. No. 324274]. Additional 
leaders employing an alpha-factor leader fragment that 
provides for secretion include hybrid alpha-factor 
leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphaf actor. See, e.g., 
P.C.T. WO 89/02463. 
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Typically, transcription termination 
sequences recognized by yeast are regulatory regions 
located 3' to the translation stop codon and thus, 
together with the promoter, flank the coding sequence. 
These sequences direct the transcription of an mRNA which 
cam be translated into the polypeptide encoded by the 
DNA. Examples of transcription terminator sequence and 
other yeast-recognized termination sequences, such as 
those coding for glycolytic enzymes, are known to those 
of skill in the art. 

Typically, the above described components, 
comprising a promoter, leader (if desired) , coding 
sequence of interest, and transcription termination 
sequence, are put together into expression constructs. 
Expression constructs or cassettes are often maintained 
in a replicon, such as an extrachromosomal element (e.g., 
plasmids) capable of stable maintenance in a host, such 
as yeast or bacteria. The replicon may have two 
replication systems, thus allowing it to be maintained, 
for example, in yeast for expression and in a procaryotic 
host for cloning and amplification. Examples of such 
yeast-bacteria shuttle vectors include YEp24 [Botstein et 
al.. Gene. 8:17-24 (1979)], pCl/1 [Brake et al., Proc. 
Natl, Acad, Sci USA , 81:4642-4646 (1984)], and YRpl7 
[Stinchcomb et al., J- Mol. Biol. . 158:157 (1982)]. In 
addition, a replicon may be either a high or low copy 
number plasmid. A high copy number plasmid will 
generally have a copy number ranging from about 5 to 
about 200, and typically about 10 to about 150. A host 
containing a high copy number plasmid will preferably 
have at least about 10, and more preferably at least 
about 20. Enter a high or low copy number vector may be 
selected, depending upon the effect on the host of the 
vector and the PACE polypeptides. See e.g., Brake et 
al. , supra . 
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Alternatively, the expression constructs can 
be integrated into the yeast genome with an integrating 
vector. Integrating vectors typically contain at least 
one sequence homologous to a yeast chromosome that allows 
the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. 
Integrations appear to result from recombinations between 
homologous DNA in the vector and the yeast chromosome 
[Orr-Weaver et al., Methods in Enzymol. . 101 :228-245 
(1983)]. An integrating vector may be directed to a 
specific locus in yeast by selecting the appropriate 
homologous sequence for inclusion in the vector. See 
Orr-Weaver et al., supra . One or more expression 
constructs may integrate, possibly affecting levels of 
recombinant protein produced [Rine et al., Proc. Natl. 
Acad. Sci. USA . 80:6750 (1983)]. The chromosomal 
sequences included in the vector can occur either as a 
single segment in the vector, which results in the 
integration of the entire vector, or as two segments 
homologous to adjacent segments in the chromosome and 
flanking the expression construct in the vector, which 
results in the stable integration of only the expression 
construct. 

Typically, extrachromosomal and integrating 
expression vectors may contain selectable markers to 
allow for the selection of yeast strains that have been 
transformed. Selectable markers may include biosynthetic 
genes that can be expressed in the yeast host, such as 
ADE2, HIS4i LEU2, TRPl, and ALG7, and the G418 resistance 
gene, which confer resistance in yeast cells to 
tunicamycin and G418, respectively. In addition, a 
suitable selectable marker may also provide yeast with 
the ability to grow in the presence of toxic compounds, 
such as metal. For example, the presence of CUPl allows 
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yeast to grow in the presence of copper ions [Butt et 
al., fTTcrobiol. Rev. > 11:351 (1987)]. 

Alternatively, some of the above described 
components can be put together into transformation 
vectors. Transformation vectors are typically made up of 
a selectable marker that is either maintained in a 
replicon or developed into an integrating vector, as 
described above. 

Expression and transformation vectors, 
either extrachromosomal replicons or integrating vectors, 
have been developed for transformation into many yeasts. 
For example, expression vectors have been developed for, 
inter alia, the following yeasts: Candida albicans 
[Kurtz, et al., Mol. Cell. Biol. , 6:142 (1986)], Candida 
maltosa [Kunze et al. , J. Basic Microbiol., 25:141 
(1985)]; Hansenula polymorpha [Gleeson et al., J. Gen. 
Microbiol . 132 :3459 (1986); Roggenkamp et al., Mol. Gen. 
Genet. 202 :302 (1986)]; Kluyveromyces fragilis [Das et 
al., J. Bacterid. 158 :1165 (1984)]; Kluyveromyces lactis 
[De Louvencourt et al., J- Bacterid. 154:737 (1983); Van 
den Berg et al., Bio/Technolocrv 8:135 (1990)]; Pichia 
guillerimondii [Kunze et al., J. Basic Microbiol. 25:141 
(1985)]; Pichia pastoris [Cregg et al., Mol. Cell. Biol. 
5:3376 (1985); U.S. Patent Nos. 4,837,148 and 4,929,555]; 
Sacchciromyces cerevisiae [Hinnen et al., Proc. NatJ.. 
Acad. Sci. USA 75:1929 (1978); Ito et al. , J. Bacterid. 
153 : 163 (1983)]; Schizosaccharomyces pombe [Beach and 
Nurse, Nature 300 :706 (1981)]; and Yarrowia lipolytica 
[Davidow, et al. , Curr. Genet. 10:380471 (1985); and 
Gaillardin et al., Curr. Genet. 10:49 (1985)]. 

Methods of introducing exogenous DNA into 
yeast hosts are well-known in the art, and typically 
include either the transformation of spheroplasts or of 
intact yeast cells treated with alkali cations. 
Transformation procedures usually vary with the yeast 
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species to be transformed. See e.g., Kurtz et al., Mol . 
Cell. Biol. 6:142 (1986); Kunze et al., J. Basic 
Microbiol. 25:141 (1985) for Candida. See, e.g., Gleeson 
et al., J. Gen. Microbiol. 132 ; 3459 (1986); Roggenkamp et 
al., Mol. Gen. Genet. 2 02 :302 (1986) for Hansenula. See, 
e.g., Das et al., J. Bacteriol. 158 :1165 (1984); De 
Louvencourt et al., J. Bacteriol. 154 ; 1165 (1983); Van 
den Berg et al.. Bio /Technology 8:135 (1990) for 
Kluyveromyces . See, e.g., Cregg et al., Mol. Cell. Biol. 
5:3376 (1985); Kunze et al., J. Basic Microbiol. 25:141 
(1985); U.S. Patent Nos. 4,837,148 and 4,929,555 for 
Pichia. See, e.g., Hinnen et al., Proc. Natl. Acad. Sci. 
USA 7511929 (1978); Ito et al. , J. Bacteriol. 153 :163 
(1983) for Saccharomyces. See, e.g., Beach and Nurse, 
Nature 300 ; 706 (1981) for Schizosaccharomyces. See, 
e.g., Davidow et al., Curr. Genet. 10:39 (1985); 
Gaillardin et al., Curr . Genet . 10:49 (1985) for 
Yarrowia. 

Additionally, the PACE gene or a fragment 
thereof can be expressed in a bacterial system. Therein, 
a bacterial promoter is any DNA sequence capable of 
binding bacterial RNA polymerase and initiating the 
downstream (3") transcription of a coding sequence (e.g. 
structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This 
transcription initiation region typically includes an RNA 
polymerase binding site and a transcription initiation 
site. A bacterial promoter may also have a second domain 
called an operator, that may overlap an adjacent RNA 
polymerase binding site at which RNA synthesis begins. 
The operator permits negative regulated (inducible) 
transcription, as a gene repressor protein may bind the 
operator and thereby inhibit transcription of a specific 
gene. Constitutive expression may occur in the absence 
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of negative regulatory elements, such as the operator. 
In addition, positive regulation may be achieved by a 
gene activator protein binding sequence, which, if 
present is usually proximal (5") to the RNA polymerase 
5 binding sequence. An example of a gene activator protein 

is the catabolite activator protein (CAP) , which helps 
initiate transcription of the lac operon in Ppcherj-chia 
coli [Raibaud et al., Annu. Rev , Genet, 18:173 (1984)]. 
Regulated expression may therefore be either positive or 

10 negative, thereby either enhancing or reducing 

transcription . 

Sequences encoding metabolic pathway enzymes 
provide particularly useful promoter sequences. Examples 
include promoter sequences derived from sugar 

15 metabolizing enzymes, such as galactose, lactose (lac) 

[Chang et al. , Nature 1£8:1056 (1987)], and maltose. 
Additional examples include promoter sequences derived 
from biosynthetic enzymes such as tryptophan (trp) 
[Goeddel et al., Nuc. Acids Res. 8:4057 (1980); Yelverton 

20 et al., Kticl. Acids Res. 9:731 (1981); U.S. Patent 

No. 4,738,921; E.P.O. Pub. Nos. 36,776 and 121,775]. The 
)S-lactomase ( bla ) promoter system [Weissmann, "The 
Cloning of Interferon and Other Mistakes" in Interferon 3 
(ed. I. Gresser, 1981)]; bacteriophage lambda PL 

25 [Shimatake et al.. Nature 292:128 (1981)] and T5 [U.S. 

Patent No. 4,689,406] promoter systems also provide 
useful promoter sequences. 

In addition, synthetic promoters which do 
not occur in nature also function as bacterial promoters. 

3 0 For example, transcription activation sequences of one 

bacterial or bacteriophage promoter may be joined with 
the operon sequences of another bacterial or 
bacteriophage promoter, creating a synthetic hybrid 
promoter [U.S. Patent No. 4,551,433]. For example, the 

3 5 tac promoter is a hybrid trp-iac promoter comprised of 
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both trp promoter and lac operon sequences that is 
regulated by the lac repressor [Amann et al., Gene 25:167 
(1983); de Boer et al,, Proc. Natl. Acad. Sci. 80^21 
(1983)]. Furthermore, a bacterial promoter can include 
5 naturally occurring promoters of non-bacterial origin 

that have the ability to bind bacterial RNA polymerase 
and initiate transcription. A naturally occurring 
promoter of non-bacterial origin can also be coupled with 
a compatible RNA polymerase to produce high levels of 

10 expression of some genes in prokaryotes . The 

bacteriophage T7 RNA polymerase /promoter system is an 
example of a coupled promoter system [Studier et al., 
Mol. Biol. 189:113 (1986); Tabor et al., Proc Nat l. Acad. 
Sci. 82:1074 (1985)]. In addition, a hybrid promoter can 

15 also be comprised of a bacteriophage promoter and an 

coli operator region [E.P.O. Pub. No. 267,851]. 

In addition to a functioning promoter 
sequence, an efficient ribosome binding site is also 
useful for the expression of the PACE gene or fragment 

20 thereof in prokaryotes. In E. coli , the ribosome binding 

site is called the Shine-Dalgarno (SD) sequence and 
includes an initiation codon (ATG) and a sequence 3-9 
nucleotides in length located 3-11 nucleotides upstream 
of the initiation codon [Shine et al., Nature 254 ; 34 

25 (1975)]. The SD sequence is thought to promote binding 

of mRNA to the ribosome by the pairing of bases between 
the SD sequence and the 3' and of E. coli 16S rRNA 
[Steitz et al., "Genetic signals and nucleotide sequences 
in messenger RNA" in Biological Regulation and 

3 0 Development: Gene Expression (ed. R.F. Goldberger, 

1979)]. To express eukaryotic genes and prokaryotic 
genes with weak ribosome-binding site [Sambrook et al., 
"Expression of cloned genes in Escherichia coli" in 
Molecular Cloning; A Laboratory Manual , cited above] . 
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PACE may be expressed intracellularly . A 
promoter sequence may be directly linked with the PACE 
gene or a fragment thereof, in which case the first amino 
acid at the N-terminus will always be a methionine, which 
is encoded by the ATG start codon. If desired, 
methionine at the N-terminus may be cleaved from the 
protein by jji vitro incubation with cyanogen bromide or 
by either in vivo on in vitro incubation with a bacterial 
methionine N-terminal peptidase [E.P.O. Pub. No. 
219,237] . 

Fusion proteins provide an alternative to 
direct expression- Typically, a DNA sequence encoding 
the N-terminal portion of an endogenous bacterial 
protein, or other stable protein, is fused to the 5' end 
of heterologous PACE coding sequences. Upon expression, 
this construct will provide a fusion of the two amino 
acid sequences. For example, the bacteriophage lamtbda 
cell gene can be linked at the 5' terminus of the PACE 
gene or fragment thereof and expressed in bacteria. The 
resulting fusion protein preferably retains a site for a 
processing enzyme (factor Xa) to cleave the bacteriophage 
protein from the PACE gene or fragment thereof [Nagai et 
al. , Nature 309:810 (1984)]. 

Fusion proteins can also be made with 
sequences from the lac Z [Jia et al.. Gene 60:197 (1987)], 
trpE [Allen et al. , J. Biotechnol. . 5:93 (1987); Makoff 
et al., J. Gen. Microbiol. 135:11 (1989), and Chey 
[E.P.O. Pub. No. 324,647] genes. The DNA sequence at the 
junction of the two amino acid sequences may or may not 
encode a cleavable site. Another example is a ubiquitin 
fusion protein. Such a fusion protein is made with the 
\ibiquitin region that preferably retains a site for a 
processing enzyme (e.g. ubiquitin specific processing- 
protease) to cleave the ubiquitin from the PACE 
polypeptide. Through this method, mature PACE 
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polypeptides can be isolated [Miller et al., 
Bio /Technology . 7:698 (1989)]. 

Alternatively, PACE polypeptides can also be 
secreted from the cell by creating chimeric DNA molecules 
5 that encode a fusion protein comprised of a signal 

peptide sequence fragment that provides for secretion of 
the PACE polypeptides in bacteria [U.S. Patent No. 
4,336,336]. The signal sequence fragment typically 
encodes a signal peptide comprised of hydrophobic amino 

10 acids which direct the secretion of the protein from the 

cell. The protein is either secreted into the growth 
media (Gram-positive bacteria) or into the periplasmic 
space, located between the inner and outer membrane of 
the cell (gram-negative bacteria) . Preferably there are 

15 processing sites, which can be cleaved either in vivo or 

in vitro , encoded between the signal peptide fragment and 
the PACE polypeptide. 

DNA encoding suitable signal sequences can 
be derived from genes for secreted bacterial proteins, 

2 0 such as the E. coli outer membrane protein gene f ompA ) 

[Masui et al., in Experimental Manipulation of Gene 
Expression (1983); Ghrayeb et al., EMBO J. 3:2437 (1984)] 
and the E. coli alkaline phosphatase signal sequence 
(EhoA) [Oka et al., Proc. Natl. Acad. Sci. 82 : 7212 
25 (1985)]. As an additional example, the signal sequence 

of the alpha-amylase gene from various Bacillus strains 
can be used to secrete heterologous proteins from B. 
subtilis [Palva et al., Proc. Natl. Acad. Sci. USA 
79:5582 (1982); E.P.O. Pub. No. 244,042]. 

3 0 Typically, transcription termination 

sequences recognized by bacteria are regulatory regions 
located 3' to the translation stop codon and thus, 
together with the promoter, flank the coding sequence. 
These sequences direct the transcription of an mRNA which 
35 can be translated into the polypeptide encoded by the 
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DNA. Transcription termination sequences frequently 
include DNA sequences (of about 50 nucleotides) which are 
capable of forming stem loop structures that aid in 
terminating transcription. Examples include 
5 transcription termination sequences derived from genes 

with strong promoters, such as the trp gene in £. cgli as 
well as other biosynthetic genes. 

Typically, the above described components, 
comprising a promoter, signal sequence (if desired) , 

10 coding sequence of interest, and transcription 

termination sequence, are put together into expression 
constructs. Expression constructs are often maintained 
in a replicon, such as an extrachromosomal element (e.g., 
plasmids) capable of stable maintenance in a host, such 

15 as bacteria. The description of similar replicon 

systems, including copy number parameters are described 
in detail above in connection with yeast expression 
systems. Such description is also applicable to 
bacterial systems. 

20 Alternatively, the expression constructs can 

be integrated into the bacterial genome with an 
integrating vector. Integrating vectors typically 
contain at least one sequence homologous to the bacterial 
chromosome that allows the vector to integrate. 

25 Integrations appear to result from recombinations between 

homologous DNA in the vector and the bacterial 
chromosome. For example, integrating vectors constructed 
with DNA from various Bacillus strains integrate into the 
Bacillus chromosome [E.P.O. Pub. No. 127,328]. 

3 0 Integrating vectors may also be comprised of 

bacteriophage or transposon sequences. 

Typically, extrachromosomal and integrating 
expression constructs may contain selectable markers to 
allow for the selection of bacterial strains that have 

35 been transformed. Selectable markers can be expressed in 
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the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, 
chloramphenicol, erythromycin, kanamycin (neomycin) , and 
tetracycline [Davies et al., Annu . Rev . Microbiol , 32:469 
(1978)]. Selectable markers may also include 
biosynthetic genes, such as those in the histidine, 
tryptophan, and leucine biosynthetic pathways. 

Alternatively, some of the above described 
components can be put together in transformation vectors. 
Transformation vectors are typically comprised of a 
selectable market that is either maintained in a replicon 
or developed into an integrating vector, as described 
above . 

Expression and transformation vectors, 
either extra-chromosomal replicons or integrating 
vectors, have been developed for transformation into many 
bacteria. For example, expression vectors have been 
developed for the following bacteria: Bacillus subtilis 
[Palva et al., Proc. Natl. Acad. Sci> USA 79:5582 (1982); 
E.P.O. Pub. Nos. 36,259 and 63,953; P.C.T. WO 84/04541]; 
E. coli [Shimatake et al.. Nature . 292 : 128 (1981); Amann 
et al.. Gene , 40:183 (1985); Studier et al., J. Mol. 
Biol. 189 :113 (1986); E.P.O. Pub. Nos. 36,776, 136,829 
and 136,907; U.K. Patent Application Serial No. 8418273]; 
Streptococcus cremoris [Powell et al., Appl. Environ. 
Microbiol. 54:655 (1988)]; Streptococcus lividans [Powell 
et al., Appl. Environ. Kicrobiol. 54:655 (1988)]; 
Streptomvces lividans [U.S. Patent No. 4,745,056]. 

Methods of introducing exogenous DNA into 
bacterial hosts are well-known in the art, and typically 
include either the transformation of bacteria treated 
with CaCl^ or other agents, such as divalent cations and 
DMSO. DNA can also be introduced into bacterial cells by 
electroporation. Transformation procedures usually vary 
with the bacterial species to be transformed. See e.g.. 
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[Masson et al. , fkms Micro biol. Lett. €0:273 (1989); 
Palva et al., PT-nr.. Natl. Acad. Sci. USA 72:5582 (1982); 
E.P.O. Pub. NOS. 36,259 and 63,953; P.C.T. WO 84/04541, 
Bacillus], [Miller et al., Proc. Nat;, ftgad. Scj,. 855856 
5 (1988); Wang et al. , Bacteriol. 172:949 (1990) for 

Campylobacter]; [Cohen et al., Proc. Natl. Acad. Sci. 
69:2110 (1973); Dower et al.. Nucleic Acids Res. 16:6127 
(1988); Kushner, "An improved method for transformation 
of Escherichia coli with ColEl-derived plasmids" in 

10 Genetic Fnaineerjnar Proc eedings of the International 

symposium on Genetic Engineering (eds. H.W. Boyer and S. 
Nicosia, 1978); Mandel et al. , J. Mol. Biol. 53:159 
(1970); Taketo, Rioehim. p -iophvs. Acta 949:318 (1988) for 
Escherichia], [Chassy et al., fems Microbiol. Lett. 

15 44:173 (1987) for Lactobacillus]; [Fiedler et al. , Anal. 

Biochem 170 :38 (1988) for Pseudomonas] ; [Augustin et al., 
FEMS Microbiol. Lett. 66:203 (1990) for Staphylococcus ] ; 
[Barany et al., .t. Bacteriol. 144:698 (1980); Harlander, 
"Transformation of Streptococcus lactis by 

20 electroporation, " in Streptoc occal Genetics (ed. J. 

Ferretti and R. Curtiss III, 1987); Perry et al., Infec. 
Immun. 32:1295 (1981); Powell et al., Appl. Environ. 
Microbiol . 54:655 (1988); Somkuti et al., Proc. 4th Evr. 
Cong. Biotechnoloov 1:412 (1987) for Streptococcus] . 

25 

C. Expression in insect Cells 

In one aspect of the invention, enhanced 
processing of a precursor polypeptide to a mature 
polypeptide is achieved by introducing into an insect 
30 host cell DNA sequences coding for PACE, yielding a 

recombinant insect cell. The precursor polypeptide and 
PACE are related in that the precursor has at least one 
selectively cleavable peptide bond, which is cleavable by 
PACE. The transcriptional initiation and expression of 
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PACE allows for an enhanced production of PACE as 
compared to the unmodified host. 

The polynucleotide encoding PACE is inserted 
into a suitable insect expression vector, and is operably 
linked to the control elements within that vector. 
Vector construction employs techniques which are known in 
the art. Various constructs can be prepared once the 
desired PACE DNA sequence is obtained. 

Generally, the components of the expression 
system include a transfer vector, usually a bacterial 
plasmid, which contains both a fragment of the 
baculovirus genome, and a convenient restriction site for 
insertion of the heterologous gene or genes to be 
expressed; a wild type baculovirus with a sequence 
homologous to the baculovirus-specif ic fragment in the 
transfer vector, which allows for the homologous 
recombination of the heterologous gene into the 
baculovirus genome, and appropriate insect host cells and 
growth media. 

After inserting the PACE DNA sequence into 
the transfer vector, the vector and the wild type viral 
genome are transfected into an insect host cell where the 
vector and viral genome are allowed to recombine. The 
packaged recombinant virus is expressed and recombinant 
plaques are identified and purified. Materials and 
methods for baculovirus/ insect cell expression systems 
are commercially available in kit form from, inter alia . 
Invitrogen, San Diego CA ("MaxBac" kit) . These 
techniques are generally known to those skilled in the 
art and fully described in Summers and Smith, Texas 
Agricultural Experiment Station Bulletin No. 1555 (1987) 
(hereinafter "Summers and Smith") , and incorporated by 
reference. 

Prior to inserting the PACE DNA sequence 
into the baculovirus genome, the above described 
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components, comprising a promoter, leader (if desired) , 
coding sequence of interest, and transcription 
termination sequence, are typically assembled into an 
intermediate transplacement construct (transfer vector) • 
This construct may contain a single gene and operably 
linked regulatory elements; multiple genes, each with its 
owned set of operably linked regulatory elements; or 
multiple genes, regulated by the same set of regulatory 
elements. Intermediate transplacement constructs are 
often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as a bacterium. The 
replicon will have a replication system, thus allowing it 
to be maintained in a suitable host for cloning and 
amplif ication . 

Currently, the most commonly used transfer 
vector for introducing foreign genes into AcNPV is 
pAc373. Many other vectors, known to those of skill in 
the art, have also been designed. These include, for 
example, pVL985 (which alters the polyhedrin start codon 
from ATG to ATT, and which introduces a BamHI cloning 
site 32 basepairs downstream from the ATT [see, e.g., 
Luckow and Summers, Virology , 17:31 (1989)]. 

The plasmid usually also contains the 
polyhedrin polyadenylation signal [Miller et al. , Ann. 
Rev. Microbiol. . 12:177 (1988) and a prokaryotic 
ampicillin-resistance ( amp ) gene and origin of 
replication for selection and propagation in Ej_ coli. 

Baculovirus transfer vectors usually contain 
a baculovirus promoter. A baculovirus promoter is any 
DNA sequence capable of binding a baculovirus RNA 
polymerase and initiating the downstream (5' to 3') 
transcription of a coding sequence (e.g. structural gene) 
into mRNA. A promoter will have a transcription 
initiation region which is usually placed proximal to the 
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5' end of the coding sequence. This transcription 
initiation region typically includes an RNA polymerase 
binding site and a transcription initiation site. A 
baculovirus transfer vector may also have a second domain 
5 called an enhancer, which, if present, is usually distal 

to the structural gene. Expression may be either 
regulated or constitutive. 

Structural genes, abundantly transcribed at 
late times in a viral infection cycle, provide 

10 particularly useful promoter sequences. Examples include 

sequences derived from the gene encoding the viral 
polyhedron protein [Friesen et al., "The Regulation of 
Baculovirus Gene Expression," in The Molecular Biology of 
Baculoviruses (ed. Walter Doerfler, 1986); E.P.O. Pub. 

15 Nos. 127,839 and 155,476]; and the gene encoding the plO 

protein [Vlak et al., J. Gen. Virol. 69:765 (1988)]. 

DNA encoding suitable signal sequences can 
be derived from genes for secreted insect or baculovirus 
proteins, such as the baculovirus polyhedrin gene 

20 [Carbonell et al., Gene, 73.:409 (1988)]. Alternatively, 

since the signals for mammalian cell posttranslational 
modifications (such as signal peptide cleavage, 
proteolytic cleavage, and phosphorylation) appear to be 
recognized by insect cells, and the signals required for 

25 secretion and nuclear accumulation also appear to be 

conserved between the invertebrate cells and vertebrate 
cells, leaders of non-insect origin, such as those 
derived from genes encoding human a-interferon [Maeda et 
al.. Nature 315 :592 (1985)]; human gastrin-releasing 

30 peptide [Lebacq-Verheyden et al., Molec. Cell. Biol, 

8:3129 (1988)]; human IL-2 [Smith et al., Proc. Nat'l 
Acad. Sci. USA , 82:8404 (1985)]; mouse IL-3 [Miyajima et 
al.. Gene . 58:273 (1987); and human glucocerebrosidase 
[Martin et al., DNA . 7:99 (1988)] can also be used to 

35 provide for secretion in insects. 
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In some instances, as described above, it 
may be desirable to have a plurality of copies, two or 
more, of the gene expressing the expression product 
precursor in relation to the PACE DNA sequence or vice 
versa. Some of the embodiments of the present invention 
include recombinant production of multiple proteins, for 
instance PACE and one or several heterologous precursor 
polypeptides. This may be accomplished by several 
different strategies. For example, PACE may be produced 
by expression of a gene encoding PACE in the 
baculovirus/ insect cell expression system described 
herein, PACE so produced may then be used to cleave 
enzymatically a heterologous precursor polypeptide, 
thereby generating a more mature form of the protein. Of 
course, both PACE and the precursor polypeptide may be 
produced by independent baculovirus/insect cell 
expression systems and subsequently admixed. 

Alternatively, PACE and one or more 
preciirsor polypeptides may be simultaneously produced by 
expression of the corresponding genes in the same insect 
cell. Each gene may be introduced into the insect cell 
by a separate transformation event, for instance separate 
transfections, transfection and baculovirus infection, or 
multiple baculovirus infections. Various combinations 
will be apparent to those skilled in the art. Transfer 
vectors can also be constructed which have two or more 
sets of operably linked expression regulating elements 
described above. Each set of expression elements has a 
unique restriction site into which a different gene may 
be inserted. Each set of elements may use the same type 
of promoter, or a different promoter may be used for each 
set. The enzyme /substrate ratio of PACE and precursor 
polypeptides may be optimized by use of different 
promoters with varying relative efficiencies. 
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Finally, a transfer vector incorporating 
multiple genes encoding PACE and one or more precursor 
polypeptides may be designed such that all genes are 
expressed as a polycistronic message under the control of 
5 a single set of regulatory elements. The resulting 

polyprotein can be processed into component parts by the 
autocatalytic activity of the PACE moiety, or by the 
incorporation of recognition sites for a site specific 
endopeptidase, such as signal peptidase, between 

10 functional domains. 

A recombinant polypeptide or polyprotein may 
be expressed intracellular ly or, if it is expressed with 
the proper regulatory sequences, it can be secreted. 
Good intracellular expression of nonfused foreign 

15 proteins usually requires heterologous genes that ideally 

have a short leader sequence containing suitable 
translation initiation signals preceding an ATG start 
signal. If desired, methionine at the N-terminus may be 
cleaved from the mature protein by in vitro incubation 

20 with cyanogen bromide. 

Alternatively, recombinant polyproteins or 
proteins which are not naturally secreted can be secreted 
from the insect cell by creating chimeric DNA molecules 
that encode a fusion protein comprised of a leader 

25 sequence fragment that provides for secretion of the 

heterologous protein from insect cells. The leader 
sequence fragment typically encodes a signal peptide 
comprised of hydrophobic amino acids which direct the 
translocation of the protein into the endoplasmic 

30 reticulum. 

After insertion of the PACE DNA sequence 
and/ or the gene encoding the expression product 
precursor, an insect cell host is co-transformed with the 
heterologous DNA of the transfer vector and the genomic 
35 DNA of wild type baculovirus — usually by co- 
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transfection. The promoter and transcription termination 
sequence of the construct will typically comprise a 2-5 
kb section of the baculovirus genome. Methods for 
introducing heterologous DNA into the desired site in the 
5 baculovirus virus are known in the art [see, e.g., 

Summers and Smith, cited above; Ju et al. (1987) cited 
above; Smith et al., Mnl , Cell. Biol.. 3:2156 (1983); and 
Luckow and Summers (1989) cited above]. For example, the 
insertion can be into a gene such as the polyhedrin gene, 

10 by homologous double crossover recombination; insertion 

can also be into a restriction enzyme site engineered 
into the desired baculovirus gene [Miller et al., 
Bioessavs . 1:91 (1989)]. The DNA sequence, when cloned 
in place of the polyhedrin gene in the expression vector, 

15 is flanked both 5' and 3' by polyhedrin-specif ic 

sequences and is positioned downstream of the polyhedrin 
promoter . 

The newly formed baculovirus expression 
vector is subsequently packaged into an infectious 

20 recombinant baculovirus. Homologous recombination occurs 

at low frequency (between about 1% and about 5%); thus, 
the majority of the virus produced after cotransfection 
is still wild- type virus. Therefore, a method is 
necessary to identify recombinant viruses. The beauty of 

25 the expression system is a visual screen allowing 

recombinant viruses to be distinguished. The polyhedrin 
protein, which is produced by the native virus, is 
produced at very high levels in the nuclei of infected 
cells at late times after viral infection. Accumulated 

30 polyhedrin protein forms occlusion bodies that also 

contain embedded particles. These occlusion bodies, up 
to 15 Mm in size, are highly refractile, giving them a 
bright shiny appearance that is readily visualized under 
the light microscope. Cells infected with recombinant 

35 viruses lack occlusion bodies. To distinguish 
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recombinant virus from wild-type virus, the transfection 
supernatant is plagued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, 
the plaques are screened under the light microscope for 
5 the presence (indicative of wild-type virus) or absence 

(indicative of recombinant virus) of occlusion bodies 
["Current Protocols in Microbiology", Vol. 2 (Ausubel et 
al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, 
cited above; Miller et al. (1989), cited above]. 

10 Recombinant baculovirus expression vectors 

have been developed for infection into several insect 
cells. For example, recombinant baculoviruses have been 
developed for, inter alia: Aedes aeoypti , Autoarapha 
calif ornica , Bombvx mori . Drosophila melanoaaster ^ 

15 Spodoptera fruaiperda , and Trichoplusia ni rP>C.T. Pub. 

No. WO89/046699; Carbonell et al., J. Virol. 56:153 
(1985); Wright, Nature 121:718 (1986); Smith et al., Mol. 
Cell. Biol. 3:2156 (1983); and see generally, Fraser et 
al.. In Vitro Cell. Dev. Biol. 25:225 (1989)]. 

20 Cells and cell culture media are 

commercially available for both direct and fusion 
expression of heterologous polypeptides in a 
baculovirus/expression system. Cell culture technology 
is generally known to those skilled in the art [see, 

25 e.g.. Summers and Smith, cited above]. 

The modified insect cells may then be grown 
in an appropriate nutrient medium, which allows for 
stable maintenance of the plasmid(s) present in the 
modified insect host. Where the expression product gene 

30 is under inducible control, the host may be grown to high 

density, and expression induced. Alternatively, where 
expression is constitutive, the product will be 
continuously expressed into the medium and the nutrient 
medium must be continuously circulated, while removing 

3 5 the product of interest and augmenting depleted 
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nutrients. The product may be purified by known 
techniques, such as, chromatography (e.g., HPLC, affinity 
chromatography, ion exchange chromatography) , 
electrophoresis, density gradient centrif ugation, solvent 
5 extraction, or the like. As appropriate, the product may 

be further purified, as required, to remove substantially 
any insect proteins which are also secreted in the medium 
or result from lysis of insect cells, to provide a 
product which is at least substantially free of host 
10 debris, e.g., proteins, lipids and polysaccharides. 

D. Deposit of biological material 

Escherichia coli strain HBlOl host cells 
transformed with a plasmid containing the PACE gene of 

15 Fig. 2, PACE/pBS24.1 have been deposited on November 30, 

1990, with the American Type Culture Collection (ATCC) , 
Rockville, MD, and designated as PACE/pBS24.1 in E. coli. 
This deposit will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the 

20 Deposit of Micro-org8misms for purposes of patent 

procedure. The accession number is ATCC 68486. 

This deposit is provided merely as 
convenience to those of skill in the art, and is not an 
admission that a deposit is required under 35 U.S.C. 

25 §112. The nucleic acid sequence of this plasmid, as well 

as the amino acid sequence of the polypeptide encoded 
thereby, are incorporated herein by reference and are 
controlling in the event of any conflict with the 
description herein. A license may be required to make, 

30 use, or sell the deposited material, and no such license 

is hereby granted. 

The following experimental section is 
intended to be merely illustrative and does not limit the 
present scope in any way. The following examples 

35 illustratively describe the construction of plasmids for 
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the expression and production of PACE in mammalian cells / 
and the co-expression of PACE and the blood coagulation 
factor, Factor IX, in mammalian cells. 

Example 1 - Construction of PACE cDNA 

This example demonstrates the construction 
of a composite recombinant cDNA which encodes mammalian 
PACE, and the characterization of the polypeptide encoded 
therein. The cDNA was constructed from two isolated 
cDNAs encoding separate portions of the PACE molecule. 

The molecular cloning of cDNAs encoding PACE 
was accomplished as follows. An oriented cDNA library 
was constructed in the yeast expression vector pAB23BXN 
using poly (A)**" mRNA isolated from the human liver cell 
line HEPG2. pAB23BXN is a derivative of pAB23BX [D. 
Schild et al., Proc. Natl. Acad. Sci. U.S.A. . 87:2916 
(1990)] into which a synthetic polylinker, that contained 
Bst XI and Not 1 sites, was inserted for unidirectional 
cDNA cloning. Oligonucleotide probes were used to 
isolate a 3,295 bp clone from the library. These probes 
were synthesized using the sequence of a partial cDNA 
clone (3.1 kb) which putatively encodes a portion of the 
fur gene product [A.J.M. Roebroek et al., EMBO J . , 5:2197 
(1986) ] . 

In order to isolate the 5 ' -end of the PACE 
cDNA, a second cDNA library from HEPG2 poly (A) RNA mRNA 
was constructed in XZAPII [Stratagene] , using specific 
internally primed message. Using the longest clone 
isolated from this library, a composite cDNA for PACE was 
constructed. The composite cDNA contains 4,351 bp and is 
comprised of 388 bp of 5 '-untranslated region, a putative 
coding sequence corresponding to 794 amino acids, and 
1597 bp of 3 ' -untranslated region, including two 
termination codons and a tail of 17 dA residues. 
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The full sequence of the composite PACE cDNA 
and the encoded protein sequence is shown in Figure 2 
with the encoded protein sequence shown above that of the 
cDNA sequence. The numbering is based on the significant 
5 open reading frame (ORF) in the cDNA. Oligonucleotide 

adaptor sequences present in the cDNA are indicated by 
lower-case letters. The putative signal peptide is 
indicated by underlining and the transmembrane domain 
(TM) by shading. Likely active site residues are 

10 indicated by asterisks. Consensus sites for Asn-linked 

glycosylation are marked by diamonds and cysteine 
residues by bars. Potential dibasic proteolytic 
processing sites are indicated by arrows. 

Based upon the composite PACE cDNA 

15 structure, the following is deduced. The translation of 

PACE is probably initiated at the ATG start codon at 
nucleotide #1. Although there are four ATG codons 
upstream from nucleotide #1, the ATG at nucleotide #1 is 
the only in-frame methionine codon in the 5 '-region of 

20 the cDNA, and the subsequent 26 amino acids constitute a 

classical hydrophobic signal sequence, which is usually 
associated with a membrane-bound protein. The signal 
peptidase cleavage site occurs between amino acids #26- 
27. 

25 A large ORF encodes a PACE precursor protein 

with a calculated molecular weight of 8 6.7 kD. In 
addition, several paired basic amino acid residues are 
located in the amino-terminal region of the PACE 
precursor (Figure 2), and could represent 

30 proteolytic /autolytic processing sites. The coding 

sequence contains three consensus sites for N-linked 
glycosylation and twenty-two cysteine residues. The 
active site is in the ORF and includes a triad of amino 
acids: aspartic acid (Asp #153), histidine (His #194), 

35 and serine (Ser #368). A cysteine-rich region (CRR) is 
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also present and, as shown in Figure 2, is located in the 
vicinity of amino acid Cys #587 to amino acid Cys #675, 
A putative hydrophobic transmembrane domain (TM) is 
located downstream from the cysteine-rich region, at 
5 approximately amino acid Val #716 to amino acid Leu #738, 

The 3 ' -untranslated region is relatively 
long (1597 bp) and contains a possible polyadenylation 
signal (ATTAAA) at nucleotides #3939-3943 of the 
composite clone. Of particular note are numerous regions 
10 of extensive potential secondary structure involving 

coding sequences, and the 3 ' -untranslated sequences 
around the termination codon. 



Example 2 - Plasmid Construction and Expression of PACE 

15 cDNA in Mammalian COS-1 Cells 

This example demonstrates the expression of 
recombinant PACE cDNA in COS-1 cells. The mammalian cell 
expression system was constructed as follows. 

A truncated 2.47 kbp PACE cDNA fragment is 

2 0 employed, which was generated from the composite PACE 

cDNA by PCR. The method utilized synthetic primers which 
hybridized to the 5 '-end of the PACE coding sequence and 
to approximately 70 bp into the 3 • -untranslated region. 
The 5' primer generated an EcoRI site for cloning into 

25 pBluescript SK" [ Stratagene] . The 3' primer generated a 

Sail cloning site. All of the PCR products were verified 
by the M13 dideoxy sequencing method. 

The 2.47 kbp (EcoRI-Sall) PACE cDNA fragment 
from pBluescript-PACE included the 794 codon PACE coding 

30 sequence (Figure 1) and 74 bases of 3 ' -untranslated 

sequence before a Sail site [van den Ouweland et al, 
cited above]. At the 5 '-end, using the EcoRI PCR primer, 
the sequence immediately preceding the ATG was modified 
to conform to the consensus translation start site. 



RIJBSTITUTF SHEET 



wo 92/09698 



PCr/US91/08725 



56 

The 2.47 kb truncated cDNA was inserted into 
the cloning site (EcoRI-Sall) of the SV40-based 
expression vector pMT3 to generate the plasmid pMT3-PACE. 
The pMT3 vector is a derivative of the vector pMT2 [R-J. 
5 Kaufman et al. , Mm, Cell, Biol.. 9:946 (1989)] in which 

the DHFR coding region on the 3 '-side of the cloning site 
has been removed. pMT3 has been deposited with the 
American Type Culture Collection (ATCC) , Rockville, MD 
(USA) under Accession Number ATCC 40348. pMT3 can also 

10 be generated starting with pMT2-vWF, which is deposited 

at the ATCC under Accession Number ATCC #67122 [see PCT 
application PCT/DS87/00033] . 

DNA of the resulting vector, pMT3-PACE, was 
purified and introduced for transient expression into 

15 SV40-transformed monkey kidney cells (COS-1) using a 

calcium phosphate transfection protocol as described in 
Chen, C. A., and Okayama, H. , BioTechniques, 6:632-638 
(1988) ; and C. Chen and H. Okayama, Mol. Cell. Biol. 
7:745 (1987). Cells were transfected with 40 Mg of 

20 plasmid per 10 cm dish in 10 mis of medium or, in the 

case of co-transfections, an equimolar ratio of plasmids 
totalling 60 fig per 10 cm dish in 10 ml of medium. 

To monitor PACE synthesis, pMT3-PACE 
transfected COS-1 cells were radiolabeled 48-60 hours 

25 following transfection using S-labeled amino acids, e.g., 

^^S-Met and ^^S-Cys, in medium lacking those amino acids, 
e.g., Cys and Met. Untransf ected cells were similarly 
treated. After a 30 minute pulse period, cell extracts 
were prepared by lysis in NP-40 lysis buffer [A.J. Dorner 

30 and R.J. Kaufman (1990), Meth. Enzvmol.. 185:577 (1990)] 

or were chased by removing the labeling medium and 
replacing it with complete medium for additional 
incubation. Cell extracts and conditioned medium were 
treated with protease inhibitors and immunoprecipitated 
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using the method described in Wise et al, Cell , ^:229- 
236 (1988). 

Immunoprecipitates were performed with 
rabbit anti-PACE antiserum produced against a PACE-£^ 
5 coli fusion protein. Rabbit anti-PACE antiserum was 

generated against the catalytic domain of PACE by 
expression of amino acids 146 to 372 of PACE as a human 
superoxide dismutase (SOD) fusion protein in E, coli > 
The DNA fragment for expression was generated by 

10 polymerase chain reaction (PCR) and cloned into the 

superoxide dismutase (SOD) fusion vector pTAC7 [Steimer 
et al, J. Virol, . 58; 9 (1986)]. The induced fusion 
protein was purified by preparative polyacrylamide gel 
electrophoresis, eluted and used to immunize rabbits in 

15 complete Freunds adjuvant. 

The immunoprecipitated samples were then 
analyzed by SDS-polyacrylamide gel electrophoresis [SDS- 
PAGE; (A 8%; B,C 6% acrylamide) ] . The gels were prepared 
for f luorography in EnHance [Dupont] . 

20 In the lysates from the control COS-1 cells 

which were not transfected with pMT3-PACE, immunoreactive 
proteins with anti-PACE antiserum were not detected. 
However, in extracts from pMT3-PACE transfected cells, 
immunoreactive species were detected that migrated in the 

25 gels primarily as a doublet of approximately 90 kD. 

These PACE immunoprecipitates were treated with the 
endoglycosidase enzyme, N-glycanase [Genzyme] , using the 
method described in A.J, Dorner and R.J- Kaufman (1990) , 
cited above. This treatment resulted in a shift in the 

30 electrophoretic mobility of the labeled proteins in the 

gels which was consistent with the presence of 
asparagine-linked oligosaccharides. However, these 
digestions did not fully reduce the complexity of the 
bands, suggesting that differential glycosylation may not 
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be the source of the observed heterogeneity in the 
expressed FACE. 

In order to analyze secretion of PACE, the 
35s-labeled cells were incubated for a 12 hour chase 
period in a medium containing an excess of unlabeled 
amino acids. The secreted products from the conditioned 
medium and in cell lysates were immunoprecipitated with 
the anti-PACE antiserum. The medium from the pMT3-PACE 
transfected cells yielded an immunoreactive protein which 
migrated in the gels as a 75 kD polypeptide. The 
relative quantity of the 75 kD immunoprecipitated PACE 
polypeptide observed in the conditioned medium was 5 to 
10 fold less than that detected in the cell lysate or 
remaining inside the cell at the 12 hour chase period. 

This secreted PACE species, which differs in 
apparent size from the intracellular species, may 
represent a truncated molecule which is missing its 
transmembrane and/or intracellular domains. This 
difference in size may possibly be the result of auto- 
proteolysis at the paired arginine residues, #497-498, 
due to the large overproduction of PACE in the 
transfected COS-1 cells. 

More extensive pulse-chase experiments 
demonstrated that the PACE translation product does not 
accumulate to high levels inside the cell compared to 
another integral membrane glycoprotein (influenza 
hemagglutinin) when synthesized at similar levels. 

Example 3 - Coexoression of PACE and vWF in Cos-1 Cells 

This example demonstrates the effect of 
recombinant PACE expression on the processing of von 
Willebrand factor (vWF) , a protein involved in blood 
coagulation, produced during co-expression of the two 
recombinant polypeptides in COS-1 cells. 
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vWF is a multimeric plasma protein which is 
normally synthesized in endothelial cells as a large 
precursor polypeptide (prepro-vWF) . Upon translocation 
into the endoplasmic reticulum (ER) , the precursor 
polypeptide undergoes signal peptide cleavage and N- 
linked oligosaccharide addition. In the ER, pro-vWF 
forms carboxy-terminal linked disulf ide-bonded dimers 
that, upon transport to the Golgi and post-Golgi 
compartments, undergo a complex series of processing 
steps. These steps include: processing of N-linked 
carbohydrate, 0-linked glycosylation, assembly of 
disulfide linked multimers, and propeptide cleavage [R.I. 
Handin and D.D. Wagner, in Progress in Hemostasis and 
Thrombosis , vol 9, B.S. Coller, Ed. (W.B. Saunders, 
Philadelphia, 1989) pp. 233-259]. 

In endothelial cells, vWF follows both a 
constitutive and regulated pathway of secretion. 
Transfection of a vWF cDNA expression vector into COS-1 
cells directs the synthesis of prepro-vWF [D.T. Bonthron 
et al., Nature . 324 :270 (1986)]. However, although COS-1 
cells do possess a protease capable of recognizing and 
cleaving the vWF propeptide, this process is inefficient. 
Thus, approximately 50% of the secreted protein from a 
typical expression study is uncleaved pro-vWF [R.J. Wise 
et al., Cell , 52:229 (1988)]. If PACE recognizes and 
cleaves the vWF propeptide, then co-expression of PACE 
with Pro-vWF should result in greater conversion of pro* 
vWF to the mature form. 

In order to demonstrate PACE conversion of 
pro-vWF to the mature form, COS-1 cells were transfected 
with either pMT3-PACE, pMT2-vWF [D.T. Bonrthron et al.. 
Nature . 324:270 (1986)], or cotransf ected with both 
plasmids. Cells were transfected with 40 /xg of plasmid, 
or in the case of co-transf ections with an equimolar 
ratio of plasmids totaling 60 per 10 cm dish in 10 ml 
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of medium. The transfected cells were pulse-labeled with 
3^S-amino acids for 30 minutes and lysed, as described in 
Example 2> or were chased by removing the labeling medium 
and replacing it with complete medium for additional 
incubation. 

Cell extracts and conditioned medium samples 
were treated with protease inhibitors and immuno- 
precipitated. Immunoprecipitation was with an anti-vWF 
polyclonal antibody [Dako Corp.] which specifically 
recognizes the mature portion of vWF. The same samples 
were also immunoprecipitated with a monoclonal antibody 
specific for the propeptide of vWF (anti-vWAgll) . 

Immunoprecipitation of cell extracts from 3 0 
minute pulse-labeled cells with anti-VWF antibody 
detected only single chain pro-vWF precursor in COS-1 
cells transfected with pMT2-vWF alone. The conditioned 
medium yielded both cleaved (mature) and uncleaved (pro- 
vWF) forms in nearly equal amounts. 

In contrast, in cellular extracts of COS-1 
cells that were co-transf ected with pMT2-vWF and pMT3- 
PACE, the 100 kD propeptide and 225 kD mature subunit 
were detected at the 30 minute pulse time point. This 
indicates that there was a significant amount of 
propeptide cleavage at this time point. In the 
conditioned medium, following a 12 hour chase period, the 
secreted vWF was completely processed to the 225 kD 
mature protein. Analysis of the amino-terminus of ^^S-Met 
labeled 225 kD product by 21 cycles of automated Edmaji 
degradation, followed by scintillation counting, yielded 
results which were consistent with cleavage at the 
correct site within the vWF precxirsor. 

Cleavage of pro-vWF to the mature form of 
vWF also yields the vWF propeptide. The production of 
this propeptide in the above studies was also monitored. 
The presence of this propeptide was shown by 
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immunoprecipitation with a monoclonal antibody directed 
against the propeptide, also known as vWF Antigen II 
[P.J. Fay et al., Nature, 232 ; 995 (1986)]. Analysis of 
the immunoprecipitated products was by polyacrylamide gel 
electrophoresis, as described above. 

The results showed that imxnunoprecipitates 
from extracts of cells transfected with pMT2-vWF alone 
yielded unprocessed pro-vWF (due to the presence of the 
uncleaved propeptide in the precursor molecule) . 
Immunoprecipitates of extracts from cells co-transf ected 
with pMT2-vWF and pMT3-PACE yielded the vWF propeptide, 
which migrated in the gels as a doublet at 100 kD. The 
doublet was reduced to a single species after digestion 
with N-glycanase, indicating that the apparent difference 
in molecular weights was due to differential 
glycosylation. 

Using a similar analysis, the conditioned 
cell media were also analyzed for the presence of 
propeptide. Immunoprecipitates of the conditioned medium 
of the pMT2-vWF transfected cells yielded the free 
propeptide and multimers of vWF. The multimers contained 
a mixture of mature vWF and pro-vWF, indicating 
incomplete processing in the singly transfected COS-1 
cells. However, the anti-Agll antibody immuno- 
precipitates from the conditioned medium from co- 
transf ected cells yielded only free propeptide, 
indicating that the pro-vWF had been totally converted 
into the mature form. 

In these studies with the detection of the 
propeptide, formation of vWF multimers in the media from 
singly transfected and co-transf ected cells was confirmed 
by non-reducing agarose gel electrophoresis, using 
essentially the technique described by R.J. Wise et al., 
Cell . 52:229 (1988). The agarose gel electrophoresis 
analysis indicated that the amount of vWF multimers in 
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the media from the singly and co-transformed cells was 
comparable . 

Example 4 - Substrate Specificity of PACE 

In order to test the recognition specificity 
of the recombinant PACE for substrates with a Lys-Arg or 
Lys-Lys cleavage site, studies were performed with 
mutants in the cleavage site of pro-vWF. One of the 
mutants, designated vWF DES, contained a non-conservative 
substitution, Lys-Arg-Ser (KRS) to Asp-Glu-Ser (DES) , at 
the propeptide cleavage site. The other mutant, 
designated vWF KKS, contained a conservative substitution 
of Lys-Lys-Ser for Lys-Arg-Ser at the propeptide cleavage 
site. 

Plasmids containing the mutant vWF genes 
were co-transfected with pMT3-PACE to determine the 
susceptibility of their expression products to cleavage 
with PACE. The analysis was carried out as described in 
Example 3 above. 

The results of the analysis showed that when 
COS-1 cells were transfected with the plasmid encoding 
vWF DES, the labeled product was secreted as an uncleaved 
pro-vWF species. The same results were obtained with 
COS-1 cells which were co-transformed with both the vWF 
DES plasmid and with pMT3-PACE. When the expression 
products of COS-1 cells transfected with the plasmid 
encoding vWF KKS were examined, the labeled product was 
again secreted as an uncleaved pro-vWF species. When the 
expression products of the co-transformants which 
expressed both PACE and the KKS mutant protein were 
examined, although some of the secreted vWF remained 
uncleaved, a significant amount of propeptide cleavage 
had occurred. 

The results of these studies with the 
mutated vWF sequences indicates that a non-conservative 
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substitution at the natural Lys-Arg cleavage site of pro- 
vWF prevents cleavage by co-expressed recombinant PACE, 
However, a conservative substitution of Lys-Lys for Lys- 
Arg still allows an acceptable substrate for the 
recombinant protease. 

Example 5 - Expression of PACE in CHO Cells 

This example illustrates the transformation 
of Chinese hamster ovary (CHO) cells with the PACE coding 
sequence. Suitable vectors were constructed as follows. 
pMT3-PACE was digested with Sail to linearize at the 3' 
end of PACE cDNA. The Sail site was filled-in with dNTPs 
and Klenow. The EcoRI linker was ligated to a blunt end 
and then digested with EcoRI. PACE cDNA was isolated on 
a gel and then ligated to EcoRI-linearized pMT2-EMC-DHFR. 
This latter plasmid is a minor derivative of pED4, 
described in R. Kaufman et al, Nucl, Acids Res. . 
19 (16) :4485-4490 (1991). 

Transformed DH5a colonies were picked for 
plasmid miniprep. Insert orientation was determined with 
Kpnl, BamHI, Bglll. The properly oriented clone was 
grown for large-scale plasmid preparation. The remainder 
of the miniprep DNA was used to transfect two CHO cell 
lines. 

A lipofection kit [BRL] was used to 
transfect CHO cells on 60 mm culture dishes in OptiMEM 
medium. The two starting cell lines were CHO-DUKX and 
PM5F-0.1, which is a VWF-producing line derived from PM5F 
by selection for resistance of O.l^M DCF. 

a-selection was started after splitting the 
cells to 100mm plates. The CHO-DUKX line was selected in 
a-MEM/10% dialysed fetal calf serum (PCS). The PM5F line 
was selected in a-MEM-AAU/10% dialysed FCS. Both lines 
showed good growth during 3 days of a a-medium selection. 
These a-selected cells were split. One plate of each 
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line (called PACE-DUKX-a and PM5F-PACE-a) was passaged in 
a-medium for 10 days then frozen for storage. 

Methotrexate (MTX) was added (0.05 fifi) to 
the selection medium four days later. Many colonies 
5 formed over approximately 1 week. These colonies were 

pooled and split for selection in methotrexate at 0.1 fiK. 
about a week later. Again, many colonies formed which 
were pooled, split and continued in selection medium with 
0.1 ^M methotrexate. These amplified pools were then 

10 frozen for storage. 

PM5F-PACE ("pool A") cells were pulse- 
labeled. Two subconfluent 100mm plates were rinsed in 
serum-free medium. 1 ml of Cys/Met deficient medium 
supplemented with 250 /iCi each of 35-S Met and 35-S Cys 

15 was added for a 15 minute pulse. One plate was lysed for 

immunoprecipitation of cell extract. Medium was removed 
from the other plate and 2 ml complete medium (serum- 
free) added for a 12 hour chase. At 12 hours, 
conditioned medium was collected and cells were lysed for 

20 immunoprecipitation. Cell lysis was in 1 ml of cold 0.5% 

Triton-X-100, M NaCl, 10 Mm Tris-HCl (pH 7.5), 5 Mm Na2- 
EDTA. Protease inhibitors were added to conditioned 
medium and cell extract. Immunoprecipitates of 0.5 ml of 
cell extract and 1 ml conditioned medium were performed 

25 with an anti-vWF antibody [DAKO] coupled two Affi-Gel and 

an anti-PACE antiserum [Chiron] secondarily bound to 
protein-A sepharose. 

Precipitates were washed in cold lysis 
buffer and analysed on SDS-PAGE. Results were similar to 

3 0 that seen in PACE plus vWF COS-1 co-transfection 

experiments. With anti-PACE, a 95-100 kDA doublet band 
was precipitated in the 15 minute cell extract. At 12 
hours, the intensity of this cell extract band was 
reduced approximately 10 fold. In the conditioned 

35 medium, at 12 hours, a 75-80 kDa single band was 
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detected. With the anti-vWF, it was determined that the 
secreted vWF at 12 hours was completely processed mature 
vWF. In the cell extract samples, both pro-vWF and 
cleaved vWF were present. 

These findings differ from that observed in 
the parent cell line, PM5F, in that secreted vWF is only 
partially processed and intracellular cleavage is 
minimal. For PM5F-PACE, a comparison of the 
autoradiographic intensities of the PACE bands and the 
vWF bands indicated that the level PACE expression is 
roughly 1/2 that of vWF. 

The PACE-DUKX ("pool 4/4") was tested in the 
manner described above. The SDS-PAGE results from anti- 
PACE immunoprecipitates demonstrated an intracellular 95- 
100 kDa doublet band in pulsed (3 0 minute) cell extract 
and the apparent secretion of a smaller (75-80 kDa) 
immunoreactive species in the chased (18 hour) 
conditioned medium. In addition, in this labeling 
experiment, PM5F-PACE cells were analyzed for comparison. 
The intensities of the PACE bands in the 30 minute cell 
extract immunoprecipitates were equal for both cell 
lines. 

Example 6 - Co-Expression of PACE and Factor IX in CHO 
Cells 

A CHO cell line producing recombinant 
Factor IX (IC4) [the IC4 cell line is described in 
Kaufman et al, J. Biol. Chem, . 261 :9622-9628 (1986)] and 
Factor IX sequences were transfected with the PACE cDNA 
described above in Example 1 operably linked to another 
amplifiable marker, adenosine deaminase. The vector 
MT3SV2Ada [R.J. Kaufman et al, Meth. Enzvm, , 185:537-566 
(1990)] was chosen for PACE expression because it 
contains a selectable ADA transcription unit but no DHFR 
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sequences and the PACE fragment could easily be inserted 
after digestion of the vector with EcoRl and Sail. 

A vector fragment was isolated from low melt 
agarose, ligated in a ratio of 5:1 (fragment to. vector) , 
5 diluted in 10 mM Tris-HCl, pH 7.5, 1 mM EDTA, and used to 

transform DH5 bacteria [Dr. Douglas Hanahan, Cold Spring 
Harbor, New York]. A nick-translated, ^^p labelled PACE 
fragment was prepared and used for filter hybridization 
to screen transformed colonies. 

10 Positively hybridizing colonies were 

isolated and DNA prepared for digestion with EcoRl and 
Sail for confirmation of PACE insertion and with Bgl II 
for correct orientation of the fragment with respect to 
adenovirus major late promoter in the vector. 

15 DNA from one colony was isolated for 

electroporation into the Factor IX producing cells, IC4. 
Pools of colonies have been selected for amplification by 
growth in 1.0 fM 2 • -deoxycof ormycin (DCF) . The presence 
of PACE in these amplified lines was confirmed by ^^S- 

20 methionine labelling and immunoprecipitation. 

Biological activity of the Factor IX protein 
in the PACE/ IX pools was analyzed by clotting assay, 
performed as described in Kaufman et al, J. Biol> Chem., 
261 :9622-9628 (1986). Cells were plated in p60 tissue 

25 culture dishes. The next day medium was reduced (1.5 ml) 

and changed to a "defined" + 1 ^q/val Vitamin K3. 

The PACE/ Factor IX pools were found to 
secrete between 2.0 and 3.1 fold more Factor IX 
biological activity than the original IC4 cell line. The 

3 0 results of a radioimmtmoassay indicated increased levels 

of 7-carboxylated protein. These results are illustrated 
in Table I below. 
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TABLE I 

Factor IX Assays in original IC4 and 
PACE Co-azprsssing Cell Lines 

CLOTTING ASSAY CLOTTING ASSAY RIA 
U/ml (pg/cell) U/ml (pg/cell) 

Cell GLA TOTAL GLA 

Mg/mL /xg/mL TOTAL 
(pg/cell) 



IC4 .28 (.32) 


.18 


(.18) 




.1 


20 


(30) 


.5% 


Co-expressors 
















0.1 ziM DCF 




0 uM 


DCF 




5 


aM DCF 




A .72 (.89) 2.7x 


.45 


(.48) 


2 . 6x 


.69 


20 


(29) 


3.4% 


B .53 (-76) 2.3X 


.39 


(.41) 


2.3x 


1.05 


22 


(27) 


4.8% 


C .66 (.73) 2.2X 


.35 


(.41) 


2.3x 


.17 


19 


(54) 


.8% 


D .46 (.66) 2. Ox 


.55 


(.55) 


3. IX 


1.14 


17 


(24) 


6.7% 


E .67 (.80) 2.5X 


.49 


(.52) 


2.9X 


.3 


11 


(34) 


2.7% 



From the first electroporation of MT3-PACE 
Ada into IC4 cells, cells were selected in a medium with 
10% dialyzed fetal calf seriam, penicillin, streptomycin, 
glutamine, 2 00 /zM Methotrexate and Adenosine, alanosine, 
uridine and O.l/xM DCF. Approximately 2 5 colonies were 
observed in plates that did not receive DNA. 

A second electroporation performed was 
selected in the same manner and approximately 100 
colonies were pooled into each of the 5 pools. Again, no 
colonies were observed on plates that did not receive 
DNA. 

Expression of PACE was detected in each pool 
by 3 0 minute pulse with ^^S Methionine followed by 2 hour 
chase and immunoprecipitation of cell extracts with a 
PACE antibody [Chiron Corporation, California]. in cells 
which express higher levels of PACE as a result of 
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selection for further DCF resistance, secretion up to 10- 
fold greater levels of y-carboxylated Factor IX was 
observed compared to the original IC4 cell line. 

The coexpression of PACE did not produce any 
5 detectable change in the size of the Factor IX protein as 

monitored by immunoprecipitation with a FIX antibody 
[Hybridtech] and SDS gel electrophoresis. 

Bvam ple 7 - Baculoviru s Expression of PACE 
20 Two baculovirus expression cassettes were 

constructed for expression of PACE in insect cells. 
Cassette I was constructed using as the PCR template, 
PACE/pBS24.1, with primers fur 102 and fur 103: 
102: 5 'CCA CCT GTC TGA TCA ATG GAG CTG AGG CCC TGG TTG3 ' 
15 103: 5 'GAG GCC TGA TCA CTA CTC AGC CAG GTG TGA GGG CAT3 ' . 

The cassette was made without a transmembrane domain. 
The pCR product was extracted with phenol/chloroform and 
precipitated with ethanol. The PCR product was then cut 
with Bell and ligated to the pAC373 vector, which was cut 
20 with Bam HI and phosphatased. Cassette II was 

constructed using as the PCR template, PACE/pBS24. 1, with 
primers fur 102 (above) and fur 104: 

104 : 5 • GCA GCC TGA TCA CTA TGG AGG TAC GGG CAG CCC CTC3 • . 

The pCR product was purified and cloned into pAC373 by 
25 the procedure described above for Construct I. 

Numerous modifications and variations of the 

present invention are included in the above-identified 

specification and are expected to be obvious to one of 

skill in the art. Such modifications and alterations to 
30 the compositions and processes of the present invention 

are believed to be encompassed in the scope of the claims 

appended hereto. 



SUBSTITUTE SHEET 



wo 92/09698 



PCT/US91/08725 



69 

WHAT IS CLAIMED IS: 

1. A host cell comprising a recombinant 
polynucleotide encoding PACE, wherein the cell is capable 
of expressing PACE. 

2 . The host cell according to claim 1 
wherein the recombinant polynucleotide is the cDNA 
sequence of Figure 1 or 2* 

3. The host cell according to claim 1, 
wherein the encoded PACE lacks a transmembrane domain. 

4. The host cell according to claim 1 
wherein said PACE encoding sequence is operably linked to 
a heterologous expression control sequence permitting 
expression of the PACE polynucleotide sequence. 

5. The host cell according to claim 1 
further comprising a polynucleotide encoding a precursor 
polypeptide, wherein the precursor polypeptide is a 
substrate for the encoded PACE, and wherein the cell is 
capable of expressing the polynucleotides encoding PACE 
and the heterologous polypeptide. 

6. The host cell according to claim 5 
wherein said precursor encoding sequence is operably 
linked to a heterologous expression control sequence 
permitting expression of the protein product of the 
precursor polynucleotide. 

7. The host cell according to claim 5 
wherein the encoded PACE and the encoded heterologous 
precursor polypeptide, when expressed, are secreted into 
extracellular medium. 
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8. The host cell according to claim 5 
wherein said precursor is a precursor of a protein which 
requires gamma-carboxylation for biological activity. 

9. The host cell according to claim 8 
wherein said precxirsor is a precursor polypeptide of a 
blood coagulation protein. 

10. The host cell according to claim 9 
wherein said protein is selected from the group 
consisting of Factor IX, Protein C, Protein S, 
Prothrombin Factor 10, Factor VII and bone gamma- 
carboxyglutamate protein. 

11. The host cell according to claim 10 
wherein said protein is Factor IX. 

12. The host cell according to claim 5 
wherein said precurser is vWF. 

13. The host cell according to claim 1, 
which is an eukaryotic cell. 

14 . The host cell according to claim 13 is 
a mammalian cell- 

15. The host cell according to claim 14, 
wherein the mammalian cell is a CHO cell. 

16. The host cell according to claim 1, 
which is an insect cell. 

17. The host cell according to claim 1, 
which is a microorganism cell. 
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18. The host cell according to claim 17 
selected from the group consisting of bacterial cells and 
fungal cells. 

19. The host cell according to claim 18 
which is a yeast cell. 

20. A recombinant expression vector 
suitable for expression in a selected host cell 
comprising a polynucleotide sequence encoding PACE, 
wherein the PACE encoding sequence is operably linked to 
an expression control sequence permitting expression of 
the polynucleotide sequence. 

21. The vector according to claim 2 0 
further comprising a polynucleotide sequence encoding a 
precursor protein which is a substrate for PACE, operably 
linked to an expression control sequence permitting 
expression of the polynucleotide sequence. 

22. The vector according to claim 21 
wherein said precursor is a precursor polypeptide of a 
protein requiring 7-carboxylation for biological 
activity, 

23. The vector according to claim 21 
wherein said precursor is a precursor polypeptide of a 
blood coagulation protein. 

24. The vector according to claim 23 
wherein said protein is selected from the group 
consisting of Factor IX, Protein C, Protein S, 
Prothrombin Factor 10, Factor VII and bone gamma- 
carboxyglutamate protein. 
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25. The vector according to claim 24 
wherein said protein is Factor IX, 

26. The vector according to claim 21 
wherein said precursor is a precursor polypeptide of vWF. 

27 • The vector according to claim 20, 
wherein said host cell is an eukaryotic cell. 

28. The vector according to claim 27 
wherein said host cell is a mammalian cell. 

29. The vector according to claim 28, 
wherein the mammalian cell is a CHO cell. 

30. The vector according to claim 20, 
wherein said host cell is an insect cell. 

31. The vector according to claim 20, 
wherein said host cell is a microorganism cell. 

32. The vector cell according to claim 31, 
wherein said microorganism cell is selected from the 
group consisting of bacterial cells and fungal cells. 

33. The vector cell according to claim 32 
wherein said fungal cell is a yeast cell. 

34. A method for producing recombinant 
PACE comprising incubating a host cell comprising a 
recombinant polynucleotide encoding PACE under conditions 
that allow expression of the PACE protein. 
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35. The method according to claim 34 
wherein said host cell further comprises a polynucleotide 
encoding a precursor polypeptide, wherein the precursor 
polypeptide is a substrate for the encoded PACE, and said 
conditions allow expression of both said PACE 
polynucleotide and said precursor polynucleotide. 

36. The method according to claim 34 
comprising the steps of: 

(a) providing a selected host cell 
transformed by a recombinant expression vector suitable 
for expression in the host cell comprising a 
polynucleotide sequence encoding PACE, wherein the PACE 
encoding sequence is operably linked to an expression 
control sequence permitting expression of the 
polynucleotide sequence, wherein said expression control 
sequence is compatible with the selected host cell; and 

(b) incubating the host cell under 
conditions that allow transformation with the expression 
vector. 

37. The method according to claim 34 
wherein said host cell is a mammalian cell. 

38. The method according to claim 34 
wherein said host cell is a microorganism cell. 
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39. A method of producing a recombinant 

PACE, comprising: 

(a) providing an insect cell infected with 
an expression vector comprising a polynucleotide sequence 
encoding PACE, wherein the PACE encoding sequence is 
operably linked to an expression control sequence 
permitting expression of the polynucleotide sequence, 
which expression control sequence is compatible with an 
insect cell; and 

(b) incubating the insect cell under 
conditions which allow infection of the vector. 

40- A method for producing a desired mature 
polypeptide comprising incubating a host cell comprising 
a polynucleotide encoding PACE and a polynucleotide 
encoding a precursor polypeptide, wherein the precursor 
polypeptide is a sxibstrate for the encoded PACE, under 
conditions that allow expression of both recombinant PACE 
and the precursor polypeptide, 

41. The method according to claim 4 0 
wherein said conditions permit the secretion of both 
recombinant PACE and said precursor polypeptide. 

42. The method according to claim 40 
wherein said conditions allow the cleavage of the 
heterologous precursor polypeptide by the expressed 
recombinant PACE. 
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43. A method of increasing the yield of a 
biologically active protein comprising culturing a host 
cell comprising a nucleotide sequence encoding PACE 
operably linked to a heterologous expression control 
sequence permitting expression of the nucleotide sequence 
and a nucleotide sequence encoding the precursor of a the 
precursor of said protein, operably linked to a 
heterologous expression control sequence permitting 
expression of said protein. 

44. The method according to claim 43 
wherein said PACE DNA sequence is present on one vector 
and said precursor DNA sequence is present on a second 
vector . 

45. The method according to claim 43 
wherein said PACE DNA and said precursor DNA are present 
on a single vector. 

46. The method according to claim 4 3 
wherein said precursor is a precursor polypeptide of a 
blood coagulation protein. 

47. The method according to claim 43 
wherein said protein is selected from the group 
consisting of Factor IX , Protein C, Protein S, 
Prothrombin Factor 10, Factor VII and bone gamma- 
carboxyglutamate protein. 

48. The method according to claim 47 
wherein said protein is Factor IX. 

49. The method according to claim 4 3 
wherein said protein is vWF. 
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50. The method according to claim 43 
wherein said host cell is a evikaryotic cell. 

51. The method according to claim 50 
wherein said host cell is a mammalian cell. 

52. The method according to claim 51 
wherein said host cell is a Chinese Hamster Ovary cell. 
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Figure lA 

ATG GAG CTC AGG CCC TGG TTC CTA TGG GTC GTA CCA CCA 3 9 

Met Glu Leu Arg Pro Trp Leu Leu Trp Val Val Ala Ala 

1 5 10 

ACA GGA ACC TTG GTC CTG CTA GCA GCT GAT GCT CAG GGC 78 

Thr Gly Thr Leu Val Leu Leu Ala Ala Asp Ala Gin Gly 

15 20 25 

CAG AAG GTC TTC ACC AAC ACG TGG GCT GTG CGC ATC CCT 117 
Gin Lys Val Phe Thr Asn Thr Trp Ala Val Arg lie Pro 
30 35 

GGA GGC CCA GCG GTG GCC AAC AGT GTG GCA CGG AAG CAT 156 
Gly Gly Pro Ala Val Ala Asn Ser Val Ala Arg Lys His 
40 45 50 

GGG TTC CTC AAC CTG GGC CAG ATC TTC GGG GAC TAT TAC 195 
Gly Phe Leu Asn Leu Gly Gin lie Phe Gly Asp Tyr Tyr 
55 60 65 

CAC TTC TGG CAT CGA GGA GTG ACG AAG CGG TCC CTG TCG 234 
His Phe Trp His Arg Gly Val Thr Lys Arg Ser Leu Ser 

70 75 

CCT CAC CGC CCG CGG CAC AGC CGG CTG CAG AGG GAG CCT 273 
Pro His Arg Pro Arg His Ser Arg Leu Gin Arg Glu Pro 
80 85 90 

CAA GTA CAG TGG CTG GAA CAG CAG GTG GCA AAG CGA CGG 312 
Gin Val Gin Trp Leu Glu Gin Gin Val Ala Lys Arg Arg 
95 100 

ACT AAA CGG GAC GTG TAC CAG GAG CCC ACA GAC CCC AAG 351 
Thr Lys Arg Asp Val Tyr Gin Glu Pro Thr Asp Pro Lys 
105 110 115 

TTT CCT CAG CAG TGG TAC CTG TCT GGT GTC ACT CAG CGG 3 90 
Phe Pro Gin Gin Trp Tyr Leu Ser Gly Val Thr Gin Arg 
120 125 130 

GAC CTG AAT GTG AAG GCG GCC TGG GCG CAG GGC TAC ACA 4 29 
Asp Leu Asn Val Lys Ala Ala Trp Ala Gin Gly Tyr Thr 

135 140 
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Figure IB 

GGG CAC GGC ATT CTG GTC TCC ATT CTG GAC GAT GGC ATC 468 
Gly His Gly lie Val Val Ser lie Leu Asp Asp Gly lie 
145 150 155 

GAG AAG AAC CAC CCC GAC TTG GCA GGC AAT TAT GAT CCT 507 
Glu Lys Asn His Pro Asp Leu Ala Gly Asn Tyr Asp Pro 
160 165 

GGG GCC AGT TTT CAT GTC AAT GAC CAG GAC CCT GAC CCC 546 
Gly Ala Ser Phe Asp Val Asn Asp Gin Asp Pro Asp Pro 
170 175 180 

CAG CCT CGG TAC ACA CAG ATG AAT GAC AAC AGG CAC GGC 585 
Gin Pro Arg Tyr Thr Gin Met Asn Asp Asn Arg His Gly 
185 190 195 

ACA CGG TGT GCG GGG GAA GTG GOT GCC GTG GCC AAC AAC 624 
Thr Arg Cys Ala Gly Glu Val Ala Ala Val Ala Asn Asn 

200 205 

CGT GTC TGT GGT GTA GGT GTG GCC TAC AAC GCC CGC ATT 663 
Gly Val Cys Gly Val Gly Val Ala Tyr Asn Ala Arg lie 
210 215 220 

GGA GGG GTC CGC ATG CTG GAT GGC GAG GTG ACA GAT GCA 702 
Gly Gly Val Arg Met Leu Asp Gly Glu Val Thr Asp Ala 
225 230 

GTG GAG GCA CGC TCG CTG GGC CTG AAC CCC AAC CAC ATC 741 
Val Glu Ala Arg Ser Leu Gly Leu Asn Pro Asn His lie 
235 240 245 

CAC ATC TAC AGT GCC AGC TGG GGC CCC GAG GAT GAC GGC 780 
His lie Tyr Ser Ala Ser Trp Gly Pro Glu Asp Asp Gly 
250 - 255 260 

AAG ACA GTG GAT GGG CCA GCC CGG CTC GCC GAG GAG GCC 819 
Lys Thr Val Asp Gly Pro Ala Arg Leu Ala Glu Glu Ala 

265 270 

TTC TTC CGT GGG CTT AGC CAG GGC CGA GGG GGG CTG GGC 858 
Phe Phe Arg Gly Val Ser Gin Gly Arg Gly Gly Leu Gly 
275 280 285 
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TCC ATC TTT GTC TGG 
Ser lie Phe Val Trp 
290 

CAT GAC AGC TGC AAC 
His Asp Ser Cys Asn 
300 

TAG ACG CTG TCC ATC 
Tyr Thr Leu Ser lie 
315 

GTG COG TGG TAG AGC 
Val Pro Trp Tyr Ser 

330 

ACG ACC TAC AGC AGT 
Thr Thr Tyr Ser Ser 
340 

GTG ACG ACT GAC TTG 
Val Thr Thr Asp Leu 
355 

ACG GGC ACC TCA GCC 
Thr Gly Thr Ser Ala 
365 

ATT GCT CTC ACC CTG 
lie Ala Leu Thr Leu 
380 

CGG GAC ATG CAA CAC 
Arg Asp Met Gin His 

395 

GCC CAC CTC AAT GCC 
Ala His Leu Asn Ala 
405 

GGG CGG AAA GTG AGC 
Gly Arg Lys Val Ser 
420 



Figure ic 

GCC TCG GGG AAC GGG 
Ala Ser Gly Asn Gly 
295 

TGC GAC GGC TAC ACC 
Cys .Asp Gly Tyr Thr 

305 

AGC AGC GCC ACG CAG 
Ser Ser Ala Thr Gin 
320 

GAG GCC TGC TCG TCC 
Glu Ala Cys Ser Ser 

335 

GGC AAC CAG AAT GAG 
Gly Asn Gin Asn Glu 
345 

CGG CAG AAG TGC ACG 
Arg Gin Lys Cys Thr 
360 

TCT GCC CCC TTA GCA 
Ser Ala Pro Leu Ala 
370 

GAG GCC AAT AAG AAC 
Glu Ala Asn Lys Asn 
385 

CTG GTG GTA CAG ACC 
Leu Val Val Gin Thr 

400 

AAC GAC TGG GCC ACC 
Asn Asp Trp Ala Thr 
410 

CAC TCA TAT GGC TAC 
His Ser Tyr Gly Tyr 



GGG CGG GAA 8 97 
Gly Arg Glu 

AAC AGT ATC 93 6 
Asn Ser lie 

310 

TTT GGC AAC 975 
Phe Gly Asn 
325 

ACA CTG GCC 1014 
Thr Leu Ala 



AAG CAG ATC 1053 
Lys Gin lie 
350 

GAG TCT CAC 1092 
Glu Ser His 



GCC GGC ATC 1131 
Ala Gly He 
375 

CTC ACA TGG 117 0 
Leu Thr Trp . 

390 

TCG AAG CCA 1209 
Ser Lys Pro 



AAT GGT GTG 124 8 
Asn Gly Val 
415 

GGG CTT TTC 128 7 
Gly Leu Leu 



SUBSTITUTE SHEET 



wo 92/09698 



PCr/US91/08?25 



4/17 



GAC GCA GGC GCC ATG 
Asp Ala Gly Ala Met 
430 

ACA GTC GCC CCC CAG 
Thr Val Ala Pro Gin 
445 

ACC GAG CCC AAA GAC 
Thr Glu Pro Lys Asp 

460 

AAC ACC GTG ACC GCG 
Lys Thr Val Thr Ala 
470 

ACT CGG CTG GAG CAC 
Thr Arg Leu Glu His 
485 

TAT AAT CGC CGT GGC 
Tyr Asn Arg Arg Gly 
495 

CCC ATG GGC ACC CGC 
Pro Met Gly Thr Arg 
510 

CAT GAC TAC TCC GCA 
His Asp Tyr Ser Ala 

525 

ATG ACA ACT CAT TCC 
Met Thr Thr His Ser 
535 

TGG GTC CTA GAG ATT 
Trp Val Leu Glu lie 
550 

TAT GGG ACG CTG ACC 
Tyr Gly Thr Leu Thr 
560 



Figure ID 

GTG .GCC CTG GCC CAG 
Val Ala Leu Ala Gin 
435 

CGG AAG TGC ATC ATC 
Arg Lys Cys lie lie 
450 

ATC GGG AAA CGG CTC 
lie Gly Lys Arg Leu 

465 

TCC CTG GGC GAG CCC 
Cys Leu Gly Glu Pro 
475 

GCT CAG GCG CGG CTC 
Ala Gin Ala Arg Leu 
490 

GAC CTG GCC ATC CAC 
Asp Leu Ala lie His 
500 

TCC ACC CTG CTG GCA 
Ser Thr Leu Leu Ala 
515 

GAT GGG TTT AAT GAC 
Asp Gly Phe Asn Asp 

530 

TGG GAT GAC GAT CCC 
Trp Asp Glu Asp Pro 
540 

GAA AAC ACC AGC GAA 
Glu Asn Thr Ser Glu 
555 

AAC TCC ACC CTC GTA 
Lys Phe Thr Leu Val 
565 



AAT TGG ACC 1326 
Asn Trp Thr 
440 

GAC ATC CTC 13 65 
Asp lie Leu 
455 

GAC CTC CGG 1404 
Glu Val Arg 



AAC CAC ATC 1443 
Asn His lie 
480 

ACC CTG TCC 1482 
Thr Leu Ser 



CTG GTC AGC 1521 
Leu Val Ser 
505 

GCC AGG CCA 1560 
Ala Arg Pro 
520 

TGG GCC TTC 1599 
Trp Ala Phe 



TCT GGG GAG 1638 
Ser Gly Glu 
545 

GCC AAC AAC 1677 
Ala Asn Asn 



CTC TAT GGC 1716 
Leu Tyr Gly 
570 



SUBSTITUTE SHEET 



wo 92/09698 



PCT/US9 1/08725 



5/17 



Figure IE 

ACC GCC CCT GAC GGG CTC CCC GTA CCT CCA GAA AGC AGT 1755 
Thr Ala Pro Glu Gly Leu Pro Val Pro Pro Glu Ser Ser 
575 580 585 

GGC TGC AAG ACC CTC ACG TCC AGT CAG GCC TGT GTG GTG 1794 
Gly Cys Lys Thr Leu Thr Ser Ser Gin Ala Cys Val Val 

590 595 

TGC GAG GAA GGC TTC TCC CTC CAC CAG AAG AGC TGT GTC 1833 
Cys Glu Glu Gly Phe Ser Leu His Gin Lys Ser Cys Val 
600 605 610 

CAG CAC TGC CCT CCA GGC TTC GCC CCC CAA GTC CTC GAT 1872 
Gin Asn Cys Pro Pro Gly Phe Ala Pro Gin Val Leu Asp 
615 620 

ACG CAC TAT AGC ACC GAG AAT GAC GTG GAG ACC ATC CGG 1911 
Thr Asn Tyr Ser Thr Glu Asn Asp Val Glu Thr lie Arg 
625 630 635 

GCC AGC GTC TGC GCC CCC TGC CAC GCC TCA TGT GCC ACA 1950 
Ala Ser Val Cys Ala Pro Cys His Ala Ser Cys Ala Thr 
640 645 650 

TGC CAG GGG CCG GCC CTG ACA GAC TGC CTC AGC TGC CCC 198 9 
Cys Gin Gly Pro Ala Leu Thr Asp Cys Leu Ser Cys Pro 

655 660 

AGC CAC GCC TCC TTG GAC CCT GTG GAG CAG ACT TGC TCC 2 02 8 
Ser His Ala Ser Leu Asp Pro Val Glu Gin Thr Cys Ser 
665 670 675 

CGG CAA AGC CAG AGC AGC CGA GAG TCC CCG CCA CAG CAG 2 067 
Arg Gin Ser Gin Ser Ser Arg Glu Ser Pro Pro Gin Gin 
680 685 

CAG CCA CCT CGG CTG CCC CCG GAG GTG GAG GCG GGG CAA 2106 
Gin Pro Pro Arg Leu Pro Pro Glu Val Glu Ala Gly Gin 
690 695 700 

CGG CTG CGG GCA GGG CTG CTG CCC TCA CAC CTG CCT GAG 214 5 
Arg Leu Arg Ala Gly Leu Leu Pro Ser His Leu Pro Glu 
705 710 715 
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FIGURE IF 



GTG GTG GCC GGC CTC AGC TGC GCC TTC ATC GTG CTG GTC 2184 
Val Val Ala Gly Leu Ser Cys Ala Phe lie Val Leu Val 

720 725 

TTC GTC ACT GTC TTC CTG CTC CTG CAG CTG CGC TCT GGC 2223 
Phe Val Thr Val Phe Leu Val Leu Gin Leu Arg Ser Gly 
730 735 740 

TTT AGT TTT CGG GGG GTG AAG GTG TAC ACC ATG GAC CGT 22 62 
Phe Ser Phe Arg Gly Val Lys Val Tyr Thr Met Asp Arg 
745 750 

GGC CTC ATC TCC TAC AAG GGG CTG CCC CCT GAA GCC TGG 23 01 
Gly Leu lie Ser Tyr Lys Gly Leu Pro Pro Glu Ala Trp 
755 760 765 

CAG GAG GAG TGC CCG TCT GAC TCA GAA GAG GAC GAG GGC 2 3 40 
Gin Glu Glu Cys Pro Ser Asp Ser Glu Glu Asp Glu Gly 
770 775 780 

CGG GGC GAG AGG ACC GCC TTT ATC AAA GAC CAG AGC GCC 23 79 
Arg Gly Glu Arg Thr Ala Phe lie Lys Asp Gin Ser Ala 

785 790 

CTC TGA 2385 

Leu End 

794 
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Figure 2A 
GAATTCG GAGATCTACA -391 

GGGCTGCCCC CGCCCGCGCC GGAGCTGGAG -361 

CCCAGGCCGA GCCCTGCCCT GGTCGCCGGC -331 

CGGGCCGAGG CCGCGCCGCC GCGCCTCCCC -301 

GCCTCCGCGC CGTGACGCTG CCGCCGGGCG -271 

CGGGGACCGC GCCGAGCCCA GGCCCCCGCC -241 

GCCGGGCTCT CCGCTCGGCC GAGGGGCGCC -211 

CGAGCCGCCG CGGCGGTCGC CTGGAAAAGT -181 

TTCCCCGCCA GGGCTCCCCA GGGGTCGGCA -151 

CTCTTCACCC TCCCGAGCCC TGCCCGTCTC -121 

GGCCCCATGC CCCCACCAGT CAGCCCCGGG - 91 

CCACAGGCAG TGAGCAGGCA CCTGGGAGCC - 61 

GAGGCCTGTG ACCAGGCCAA GGAGACGGGC - 31 

GCTCCAGGGT CCCAGCCACC TGTCCCCCCC - 1 
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Figure 2B 

ATG GAG CTG AGG CCC TGG TTG CTA TGG GTG GTA GCA GCA 39 
Met Glu Leu Arg Pro Trp Leu Leu Trp Val Val Ala Ala 
15 10 

ACA GGA ACC TTG GTC CTG CTA GCA GCT GAT GCT CAG GGC 78 
Thr Gly Thr Leu Val Leu Leu Ala Ala Asp Ala Gin Gly 
15 20 25 

CAG AAG GTC TTC ACC AAC ACG TGG GCT GTG CGC ATC CCT 117 
Gin Lys Val Phe Thr Asn Thr Trp Ala Val Arg lie Pro 
30 35 

GGA GGC CCA GCG GTG GCC AAC AGT GTG GCA CGG AAG CAT 156 
Gly Gly Pro Ala Val Ala Asn Ser Val Ala Arg Lys His 
40 45 50 

GGG TTC CTC AAC CTG GGC CAG ATC TTC GGG GAC TAT TAG 195 
Gly Phe Leu Asn Leu Gly Gin lie Phe Gly Asp Tyr Tyr 
55 60 65 

CAC TTC TGG CAT CGA GGA GTG ACG AAG CGG TCC CTG TCG 23 4 
His Phe Trp His Arg Gly Val Thr Lys Arg Ser Leu Ser 

70 75 

CCT CAC CGC CCG CGG CAC AGC CGG CTG CAG AGG GAG CCT 273 
Pro His Arg Pro Arg His Ser Arg Leu Gin Arg Glu Pro 
80 85 90 

CAA GTA CAG TGG CTG GAA CAG CAG GTG GCA AAG CGA CGG 312 
Gin Val Gin Trp Leu Glu Gin Gin Val Ala Lys Arg Arg 
95 100 

ACT AAA CGG GAC GTG TAC CAG GAG CCC ACA GAC CCC AAG 3 51 
Thr Lys Arg Asp Val Tyr Gin Glu Pro Thr Asp Pro Lys 
105 110 115 

TTT CCT CAG CAG TGG TAC CTG TCT GGT GTC ACT CAG CGG 390 
Phe Pro Gin Gin Trp Tyr Leu Ser Gly Val Thr Gin Arg 
120 125 130 

GAC CTG AAT GTG AAG GCG GCC TGG GCG CAG GGC TAC ACA 42 9 
Asp Leu Asn Val Lys Ala Ala Trp Ala Gin Gly Tyr Thr 

135 140 
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Figure 2C 



GGG CAC GGC ATT GTG GTC TCC ATT CTG GAC GAT GGC ATC 4 68 
Gly His Gly lie Val Val Ser lie Leu Asp Asp Gly lie 
145 150 155 

GAG AAG AAC CAC CCG GAC TTG GCA GGC AAT TAT GAT CCT 5 07 
Glu Lys Asn His Pro Asp Leu Ala Gly Asn Tyr Asp Pro 
160 165 

GGG GCC AGT TTT GAT GTC AAT GAC CAG GAC CCT GAC CCC 54 6 
Gly Ala Ser Phe Asp Val Asn Asp Gin Asp Pro Asp Pro 
170 175 180 

CAG CCT CGG TAC ACA CAG ATG AAT GAC AAC AGG CAC GGC 585 
Gin Pro Arg Tyr Thr Gin Met Asn Asp Asn Arg His Gly 
185 190 195 

ACA CGG TGT GCG GGG GAA GTG GCT GCG GTG GCC AAC AAC 624 
Thr Arg Cys Ala Gly Glu Val Ala Ala Val Ala Asn Asn 

200 205 

GGT GTC TGT GGT GTA GGT GTG GCC TAC AAC GCC CGC ATT 6 63 
Gly Val Cys Gly Val Gly Val Ala Tyr Asn Ala Arg lie 
210 215 220 

GGA GGG GTG CGC ATG CTG GAT GGC GAG GTG ACA GAT GCA 702 
Gly Gly Val Arg Met Leu Asp Gly Glu Val Thr Asp Ala 
225 230 

GTG GAG GCA CGC TCG CTG GGC CTG AAC CCC AAC CAC ATC 741 
Val Glu Ala Arg Ser Leu Gly Leu Asn Pro Asn His lie 
235 240 245 

CAC ATC TAC AGT GCC AGC TGG GGC CCC GAG GAT GAC GGC 78 0 
His lie Tyr Ser Ala Ser Trp Gly Pro Glu Asp Asp Gly 
250 255 260 

AAG ACA GTG GAT GGG CCA GCC CGC CTC GCC GAG GAG GCC 819 
Lys Thr Val Asp Gly Pro Ala Arg Leu Ala Glu Glu Ala 

265 270 

TTC TTC CGT GGG GTT AGC CAG GGC CGA GGG GGG CTG GGC 8 58 
Phe Phe Arg Gly Val Ser Gin Gly Arg Gly Gly Leu Gly 
275 280 285 
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TCC ATC 
Ser lie 



CAT GAC 
His Asp 
300 

TAG ACG 
Tyr Thr 



GTG CCG 
Val Pro 



ACG ACC 
Thr Thr 
340 

GTG ACG 
Val Thr 



ACG GGC 
Thr Gly 
365 

ATT GCT 
lie Ala 



CGG GAC 
Arg Asp 



GCC CAC 
Ala His 
405 

GGC CGG 
Gly Arg 



TTT GTC 
Phe Val 
290 

AGC TGC 
Ser Cys 



CTG TCC 
Leu Ser 
315 

TGG TAC 
Trp Tyr 



TAC AGC 
Tyr Ser 



ACT GAC 
Thr Asp 
355 

ACC TCA 
Thr Ser 



CTC ACC 
Leu Thr 
380 

ATG CAA 
Met Gin 



CTC AAT 
Leu Asn 



AAA GTG 
Lys Val 
420 



Fig 

TGG GCC 
Trp Ala 



AAC TGC 
Asn Cys 
305 

ATC AGC 
lie Ser 



AGC GAG 
Ser Glu 
330 

AGT GGC 
Ser Gly 



TTG CGG 
Leu Arg 



GCC TCT 
Ala Ser 
370 

CTG GAG 
Leu Glu 



CAC CTG 
His Leu 
395 

GCC AAC 
Ala Asn 



AGC CAC 
Ser His 



re 2D 

TCG GGG 
Ser Gly 



GAC GGC 
Asp Gly 



AGC GCC 
Ser Ala 
320 

GCC TGC 
Ala Cys 



AAC CAG 
Asn Gin 
345 

CAG AAG 
Gin Lys 



GCC CCC 
Ala Pro 



GCC AAT 
Ala Asn 
385 

GTG GTA 
Val Val 



GAC TGG 
Asp Trp 
410 

TCA TAT 
Ser Tyr 



AAC GGG 
Asn Gly 
295 

TAC ACC 
Tyr Thr 



ACG CAG 
Thr Gin 



TCG TCC 
Ser Ser 
335 

AAT GAG 
Asn Glu 



TGC ACG 
Cys Thr 
360 

TTA GCA 
Leu Ala 



AAG AAC 
Lys Asn 



CAG ACC 
Gin Thr 
400 

GCC ACC 
Ala Thr 



GGC TAC 
Gly Tyr 
425 



GGC CGG 
Gly Arg 



AAC AGT 
Asn Ser 
310 

TTT GGC 
Phe Gly 



ACA CTG 
Thr Leu 



AAG CAG 
Lys Gin 
350 

GAG TCT 
Glu Ser 



GCC GGC 
Ala Gly 
375 

CTC ACA 
Leu Thr 



TCG AAG 
Ser Lys 



AAT GGT 
Asn Gly 
415 

GGG CTT 
Gly Leu 



GAA 897 
Glu 



ATC 936 
He 



AAC 975 

Asn 

325 

GCC 1014 
Ala 



ATC 1053 
He 



CAC 1092 
His 



ATC 1131 
He 



TGG 1170 

Trp 

390 

CCA 12 09 
Pro 



GTG 1248 
Val 



TTG 1287 
Leu 
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Figure 2E 

GAC GCA GGC GCC ATG GTG GCC CTG GCC CAG AAT TGG ACC 132 6 
Asp Ala Gly Ala Met Val Ala Leu Ala Gin Asn Trp Thr 
430 435 440 

ACA GTG GCC CCC CAG CGG AAG TGC ATC ATC GAC ATC CTC 13 65 
Thr Val Ala Pro Gin Arg Lys Cys lie lie Asp lie Leu 
445 450 455 

ACC GAG CCC AAA GAC ATC GGG AAA CGG CTC GAG GTG CGG 1404 
Thr Glu Pro Lys Asp lie Gly Lys Arg Leu Glu Val Arg 

460 465 

AAG ACC GTG ACC GCG TGC CTG GGC GAG CCC AAC CAC ATC 1443 
Lys Thr Val Thr Ala Cys Leu Gly Glu Pro Asn His lie 
470 475 480 

ACT CGG CTG GAG CAC GCT CAG GCG CGG CTC ACC CTG TCC 14 82 
Thr Arg Leu Glu His Ala Gin Ala Arg Leu Thr Leu Ser 
485 490 

TAT AAT CGC CGT GGC GAC CTG GCC ATC CAC CTG GTC AGC 1521 
Tyr Asn Arg Arg Gly Asp Leu Ala lie His Leu Val Ser 
495 500 505 

CCC ATG GGC ACC CGC TCC ACC CTG CTG GCA GCC AGG CCA 1560 
Pro Met Gly Thr Arg Ser Thr Leu Leu Ala Ala Arg Pro 
510 515 520 

CAT GAC TAG TCC GCA GAT GGG TTT AAT GAC TGG GCC TTC 1599 
His Asp Tyr Ser Ala Asp Gly Phe Asn Asp Trp Ala Phe 

525 530 

ATG ACA ACT CAT TCC TGG GAT GAG GAT CCC TCT GGC GAG 163 8 
Met Thr Thr His Ser Trp Asp Glu Asp Pro Ser Gly Glu 
535 540 545 

TGG GTC CTA GAG ATT GAA AAC ACC AGC GAA GCC AAC AAC 1677 
Trp Val Leu Glu lie Glu Asn Thr Ser Glu Ala Asn Asn 
550 555 

TAT GGG ACG CTG ACC AAG TTC ACC CTC GTA CTC TAT GGC 1716 
Tyr Gly Thr Leu Thr Lys Phe Thr Leu Val Leu Tyr Gly 
560 565 570 
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Figure 2F 

ACC GCC CCT GAG GGG CTG CCC GTA CCT CCA GAA AGC AGT 1755 

Thr Ala Pro Glu Gly Leu Pro Val Pro Pro Glu Ser Ser 
575 580 585 

GGC TGC AAG ACC CTC ACG TCC AGT CAG GCC TGT GTG GTG 1794 
Gly Cys Lys Thr Leu Thr Ser Ser Gin Ala Cys Val Val 

590 595 

TGC GAG GAA GGC TTC TCC CTG CAC CAG AAG AGC TGT GTC 183 3 
Cys Glu Glu Gly Phe Ser Leu His Gin Lys Ser Cys Val 
600 605 610 

CAG CAC TGC CCT CCA GGC TTC GCC CCC CAA GTC CTC GAT 1872 
Gin His Cys Pro Pro Gly Phe Ala Pro Gin Val Leu Asp 
615 620 

ACG CAC TAT AGC ACC GAG AAT GAC GTG GAG ACC ATC CGG 1911 
Thr His Tyr Ser Thr Glu Asn Asp Val Glu Thr lie Arg 
625 630 635 

GCC AGC GTC TGC GCC CCC TGC CAC GCC TCA TGT GCC ACA 1950 
Ala Ser Val Cys Ala Pro Cys His Ala Ser Cys Ala Thr 
640 645 650 

TGC CAG GGG CCG GCC CTG ACA GAC TGC CTC AGC TGC CCC 1989 
Cys Gin Gly Pro Ala Leu Thr Asp Cys Leu Ser Cys Pro 

655 660 

AGC CAC GCC TCC TTG GAC CCT GTG GAG CAG ACT TGC TCC 2 028 
Ser His Ala Ser Leu Asp Pro Val Glu Gin Thr Cys Ser 
665 670 675 

CGG CAA AGC CAG AGC AGC CGA GAG TCC CCG CCA CAG CAG 2 067 
Arg Gin Ser Gin Ser Ser Arg Glu Ser Pro Pro Gin Gin 
680 685 

CAG CCA CCT CGG CTG CCC CCG GAG GTG GAG GCG GGG CAA 2106 
Gin Pro Pro Arg Leu Pro Pro Glu Val Glu Ala Gly Gin 
690 695 700 

CGG CTG CGG GCA GGG CTG CTG CCC TCA CAC CTG CCT GAG 214 5 
Arg Leu Arg Ala Gly Leu Leu Pro Ser His Leu Pro Glu 
705 710 715 
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Figure 2G 



GTG GTG GCC GGC CTC AGC TGC GCC TTC ATC GTG CTG GTC 2184 
Val Val Ala Gly Leu Ser Cys Ala Phe lie Val Leu Val 

720 725 

TTC GTC ACT GTC TTC CTG GTC CTG CAG CTG CGC TCT GGC 222 3 
Phe Val Thr Val Phe Leu Val Leu Gin Leu Arg Ser Gly 
730 735 740 

TTT AGT TTT CGG GGG GTG AAG GTG TAG ACC ATG GAG CGT 2262 
Phe Ser Phe Arg Gly Val Lys Val Tyr Thr Met Asp Arg 
745 750 

GGC CTC ATC TCC TAC AAG GGG CTG CCC CCT GAA GCC TGG 2301 
Gly Leu lie Ser Tyr Lys Gly Leu Pro Pro Glu Ala Trp 
755 760 765 

CAG GAG GAG TGC CCG TCT GAC TCA GAA GAG GAC GAG GGC 234 0 
Gin Glu Glu Cys Pro Ser Asp Ser Glu Glu Asp Glu Gly 
770 775 780 

CGG GGC GAG AGG ACC GCC TTT ATC AAA GAC CAG AGC GCC 2379 
Arg Gly Glu Arg Thr Ala Phe lie Lys Asp Gin Ser Ala 

785 790 

CTC TGA TGA 23 8 8 

Leu 
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Figure 2H 

GCCCACTGCC CACCCCCTCA AGCCAATCCC 241 8 
CTCCTTGGGC ACTTTTTAAT TCACCAAAGT 2448 
ATTTTTTTAT CTTGGGACTG GGTTTGGACC 2478 
CCAGCTGGGA GGCAAGAGGG GTGGAGACTG 2508 
TTTCCCATCC TACCCTCGGG CCCACCTGGC 2538 
CACCTGAGGT GGGCCCAGGA CCAGCTGGGG 2568 
CGTGGGGAGG GCCGTACCCC ACCCTCAGCA 2598 
CCCCTTCCAT GTGGAGAAAG GAGTGAAACC 2628 
TTTAGGGCAG CTTGCCCCGG CCCCGGCCCC 2658 
AGCCAGAGTT CCTGCGGAGT GAAGAGGGGC 2688 
AGCCCTTGCT TGTTGGGATT CCTGACCC AG 27 1 8 
GCCGCAGCTC TTGCCCTTCC CTGTCCCTCT 2748 
AAAGCAATAA TGGTCCCATC CAGGCAGTCG 2778 
GGGGCTGGCC TAGGAGATAT CTGAGGGAGG 2808 
AGGCCACCTC TCCAAGGGCT TCTGCACCCT 2838 
CCACCCTGTC CCCCAGCTCT GGTGAGTCTT 2868 
GGCGGCAGCA GCCATCATAG GAAGGGACCA 2898 
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Figure 21 

AGGCAAGGCA GGTGCCTCCA GGTGTGCACG 2928 
TGGCATGTGG CCTGTGGCCT GTGTCCCATG 2958 
ACCCACCCCT GTGCTCCGTG CCTCCACCAC 2988 
CACTGGCCAC CAGGCTGGCG CAGCCAAGGC 3018 
CGAAGCTCTG GCTGAACCCT GTGCTGGTGT 3048 
CCTGACCACC CTCCCCTCTC TTGCACCCGC 3078 
CTCTCCCGTC AGGGCCCAAG TCCCTGTTTT 3108 
CTGAGCCCGG GCTGCCTGGG CTGTTGGCAC 3138 
TCACAGACCT GGAGCCCCTG GGTGGGTGGT 3168 
GGGGAGGGGC GCTGGCCCAG CCGGCCTCTC 3198 
TGGCCTCCCA CCCGATGCTG CTTTCCCCTG 3228 
TGGGGATCTC AGGGGCTGTT TGAGGATATA 3258 
TTTTCACTTT GTGATTATTT CACTTTAGAT 3288 
GCTGATGATT TGTTTTTGTA TTTTTAATGG 3318 
GGGTAGCAGC TGGACTACCC ACGTTCTCAC 3348 
ACCCACCGTC CGCCCTGCTC CTCCCTGGCT 3378 
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Figure 2 J 

GCCCTGGCCC TGAGGTGTGG GGGCTGCAGC 3408 
ATGTTGCTGA GGAGTGAGGA ATAGTTGAGC 3438 
CCCAAGTCCT GAAGAGGCGG GCCAGCCAGG 3468 
CGGGCTCAAG GAAAGGGGGT CCCAGTGGGA 3498 
GGGGCAGGCT GACATCTGTG TTTCAAGTGG 3528 
GGCTCGCCAT GCCGGGGGTT CATAGGTCAC 3558 
TGGCTCTCCA AGTGCCAGAG GTGGGCAGGT 3588 
GGTGGCACTG AGCCCCCCCA ACACTGTGCC 3618 
CTGGTGGAGA AAGCACTGAC CTGTCATGCC 3648 
CCCCTCAAAC CTCCTCTTCT GACGTGCCTT 3678 
TTGCACCCCT CCCATTAGGA CAATCAGTCC 3708 
CCTCCCATCT GGGAGTCCCC TTTTCTTTTC 3738 
TACCCTAGCC ATTCCTGGTA CCCAGCCATC 3768 
TGCCCAGGGG TGCCCCCTCC TCTCCCATCC 3798 
CCCTGCCCTC GTGGCCAGCC CGGCTGGTTT 3828 
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Figure 2K 

TGTAAGATAC TGGGTTGGTG CACAGTGATT 3858 
TTTTTCTTGT AATTTAAACA GGCCCAGCAT 3888 
TGCTGGTTCT ATTTAATGGA CATGAGATAA 3918 
TGTTAGAGGT TTTAAAGTGA TTAAACGTGC 3948 
AGACTATGCA AACCAAAAAA AAAAAAAAAA 3978 
ACCGTCGACA AAGCGGCCGC 3998 
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